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The  goal  of  this  program  of  research  has  been  to  specify  auditory  processing  in 
the  presence  of  noisy  backgrounds.  A  variety  of  experimental  and  modeling  approaches 
have  been  employed  to  examine  this  processing.  Overall  the  results  suggest  the 
Importance  of  spectraUand  temporal  comparisons  in  signal  detection  and  suggest  that 
slmflar  processing  underlies  monaural  and  binaural  detection.  The  introduction  of  masker 
energy  in  temporal  Intervals  that  did  not  overlap  with  the  signal  could  be  shown  to< 
either  enhance  or  degrade  detection  performance,  depending  on  the  intecaural  parameters 
of  the  stimuli.  Experiments  on  remote  masking  and  suppression  showed  excitatory  and 
inhibitory  effects  that  extended  across  more  than  an  octave.  These  results  are  beiiig 
used  to  evaluate  a  nonlinear  model  of  cochlear  processings  The  responses  of  subjects  'l 
to  individual  stimuli  (reproducible  noise  samples)  were  highly  correlated  between  monaural 
and  binaural  conditions  that  had  seemed  dramatically  different  when  the  ensemble 
performance  was  considered  (l«e.,  data  averaged  across  noise  samples).  These  results 


xt.  ASSTMcr  ucunrv  OAWKAnoN 

cIolsst-Ri-iQ.-  - - 

72a,  NAAM  Of  MSfOMtWU  MOIVRMJAi. 

John  F.  Tangney 

XXb.  TSUfMONC  flKAMb  AaaA  CmMI  XXC.  Of flCC  SVMbOV 

(202)  767-5021  NL 

OOFormU?].  JUN  M  ,  ticuwTV  ftAiwtCAnoH  Of  tftiiL 

-  RON  CLASSIFIED 


08  1  13  023 


2  5  NOV  1991 


R.H.  Gflkey,  566-86-7642,  Page  2 
Final  Report 
AFOSR  89-0302 

April  1,  1989  to  January  31,  1991 

I.  SUMMARY 

The  goal  of  this  project  has  been  to  specify  the  t'ansformations  used  by  the  auditory  system  in  order  to 
determine  the  presence  of  the  signal  in  an  auditory  masking  task,  with  particular  emphasis  on  the  role  of 
processes  that  compare  information  in  the  frequency  domain  and  in  the  time  domain.  Studies  of  binaural 
masking  show  that  masking  noise  that  does  not  overlap  with  the  signal  in  time  can  either  improve  or  degrade 
the  detectability  of  the  signal,  depending  on  the  interaural  phase  relations  among  the  masker,  the  fringe,  and 
the  signal.  The  results  from  studies  of  monaural  pure  tone  masking  and  suppression  are  being  used  to 
evaluate  a  non-linear  model  of  cochlear  processing  that  may  play  a  signiPicant  role  in  the  spectral  comparison 
process.  Studies  that  examine  the  responses  of  subjects  to  each  individual  noise  alone  and  signal-plus-noise 
stimulus  (Molecular  Psychophysics)  suggest  that  similar  cues  determine  performance  m  monaural  and 
binaural  masking  tasks,  a  result  not  predicted  by  many  models  of  binaural  processing. 

II.  RESEARCH  OBJECTIVES 

The  overall  goal  of  this  program  of  research  is  to  specify  the  processes  used  by  the  auditory  system  to 
detect  signals  presented  in  noisy  backgrounds.  It  is  assumed  that  the  behavior  of  the  subject  can  be  modeled 
by  a  system  that  on  each  trial  computes  a  single  quantity,  the  "decision  variable*  of  the  model,  which  in  the 
manner  described  by  the  Theory  of  Signal  Detectability  provides  the  basis  of  the  subject’s  decision  about  the 
presence  or  absence  of  the  signal.  Within  this  framework  our  task  is  to  determine  the  decision  variable  of 
the  subject.  For  the  tone-in-noise  detection  task  we  have  been  investigating,  classical  models  argue  that  the 
decision  variable  is  based  on  processing  within  the  narrow  frequency  band  centered  around  the  signal  (i.e., 
the  critical  band)  and  within  the  brief  temporal  window  that  contains  the  signal.  We  have  used  a  variety  of 
approaches  to  demonstrate  that  these  classical  models  are  oversimplifications,  to  develop  models  that  provide  a 
more  accurate  description  of  the  responses  of  the  subject,  and  to  delineate  the  relation  between  the 
mechanisms  underlying  monaural  and  binaural  masking. 

III.  STATUS  OF  THE  RESEARCH 

Additional  support  for  this  research  has  been  provided  by  a  grant,  NIH  (DC-00786)  'Monaural 
masking,  binaural  masking  and  their  interrelations,"  period  of  support  May  1,  1990  through  April  30,  1994, 
R.H.  Gilkey,  PI. 

Molar  paychophysknl  analysis  of  modcb  of  masking 

Binaural  temporal  masking.  Because  of  the  binaural  Masking  Level  Difference  (MLD),  if  the 
interaural  phase  of  a  noise  masker  is  switched  during  the  observation  interval  from  in  phase  (NO)  to  180* 
out  of  phase  (Nn)  or  from  Nn  to  NO,  a  brief  interauraliy  out-of-phase  signal  (Sll)  will  be  about  15  dB  more 
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detectable  in  the  NO  portion  of  the  noise  than  in  the  Nr  portion.  By  investigating  the  change  in  detectability 
as  a  function  of  the  delay  (At)  between  the  onset  of  the  signal  and  the  phase  transition  in  the  noise,  the 
temporal  response  of  the  binaural  system  can  be  evaluated.  The  results  of  this  case  can  be  contrasted  widi  a 
set  of  conditions  in  which  the  interaurai  phase  of  the  noise  is  held  constant  (Nr),  but  die  levd  of  the  noise  is 
reduced  or  increased  by  15  dB  halfway  through  the  observation  interval.  Within  a  model  such  as  the  EC 
model  (N.I.  Durlach,  in  J.V.  Tobias  (Ed.)  Foundations  of  Modem  Auditory  Theory  II.  371*462,  1972),  the 
first  case  produces  a  change  of  level  only  in  the  binaural  channel.  The  second  case  produces  a  change  in  the 
level  in  the  monaural  channel  as  well.  The  curves  that  describe  the  relation  between  threshold  and  At  can  be 
thought  of  as  temporal  masking  functions.  They  show,  like  traditional  temporal  masking  data,  that  the  decay 
of  backward  masking  (cases  where  the  NO  segment  of  the  noise  precedes  an  Nr  segment  or  where  the  lower 
intensity  segment  of  the  noise  precedes  the  higher  intensity  segment)  is  more  rapid  than  6or  forward  masking. 
Double*sided  exponential  integration  windows  have  been  fit  to  the  forward  and  backward  masking  functions. 
The  equivalent  rectangular  duration  of  the  best-fitting  window  under  monaural  conditions  ranges  from  12-26 
ms,  somewhat  larger  than  those  estimated  by  Moore  et  al.  [J.  Acoust.  Soc.  Am.  1102-1116,  1988].  The 
equivalent  rectangular  duration  for  the  binaural  conditions  ranges  from  41-83  ms,  similar  to  estimates  by 
Grantham  and  Wightman  [J.  Acoust.  Soc.  Am.  65:  1509-1517,  1979].  The  observed  dififerences  between 
monaural  and  binaural  conditions  were  taken  as  additional  evidence  that  the  binaural  system  responds 
sluggishly  to  changing  stimulation  [Grantham  and  Wightman,  J.  Acoust.  Soc.  Am.  511-523,  1978].  A 
published  paper  is  included  in  the  appendix  [Kollmeier  and  Gilkey,  J.  Acoust.  Soc.  Am.  87,  1709-1719, 
1990]. 

In  studying  the  effects  of  a  forward  masker  fringe,  Yost  [J.  Acoust.  Soc.  Am.  78:  901-  907,  1985] 
found  that  the  threshold  for  a  brief  Sr  signal  masked  by  a  brief  NO  masking  noise  was  not  changed  when  an 
Nr  forward  masker  fringe  was  added.  This  result  was  somewhat  surprising  in  light  of  results  such  as  those 
of  McFadden  [J.  Acoust.  Soc.  Am.  1414-1419,  1966]  who  showed  that  an  NO  forward  foinge 
substantially  improved  performance  in  an  NOSr  detection  task,  and  concluded  that  the  system  uses  the 
forward  fringe  as  a  diotic  reference  against  which  to  detect  the  dichotic  signal.  If  an  NO  forward  fringe 
provides  a  useful  reference,  it  might  be  expected  that  an  Nr  forward  fringe  would  provide  a  detrimental 
reference.  Yost's  results  also  seemed  to  conflict  with  tiie  interpretations  of  Kollmeier  and  Gilkey  [op.  cit.], 
who  thought  of  the  Nr  fringe  as  a  forward  masker.  One  possibility  was  that  the  function  tliM  relates 
threshold  to  At  fx  the  Nr  forward  fringe  condition  Intersects  with  the  function  that  relates  threshold  to  At 
for  the  poised  muker  condition  at  At  ~  0,  even  though  the  functions  are  different  elsewhere.  To  resolve 
these  questions,  the  detectability  of  an  Sr  tonal  signal  was  investigated  as  a  function  of  At,  in  the  presence  of 
an  NO  ‘’masker’’  that  was  preceded  by  quiet  or  by  an  Nr  ’’forward  fringe,”  and  followed  by  quiet  or  by  an 
NO  or  NR  *bockward  fringe.”  The  results  show  that  the  functions  for  the  Nr  forward  fringe  condition  and 
the  pulsed  masker  oontUtion  are  indeed  different  and  that  foey  do  not  intersect.  Overall,  the  results  foiled  to 
replicate  those  of  Yort,  showing  instead  that  the  presence  of  eitiier  an  NR  forward  fringe  or  an  NR  ba^ward 
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fringe  reduced  deiecttbiUty  for  ail  subjects  under  a  variety  of  conditions.  The  results  are  a  further  indication 
that  the  autUtory  system  uses  information  that  does  not  overlap  with  the  signal  in  the  temporal  domain. 
Subsequent  measurements  indicate  that  the  difference  between  Yost’s  results  and  ours  cannot  be  explained 
based  on  differences  in  psychophysical  procedure,  the  amount  or  type  of  training  received  by  die  subjects,  or 
the  duration  of  the  signal.  A  published  paper  is  included  in  the  appendix  {R.H.  Gilkey,  B.D.  Simpson,  and 
J.M.  Weisenberger,  J.  Acoust.  Soc.  Am.  M,  1323-1332,  1990]. 

McFadden  [J.  Acoust.  Soc.  Am.  83:  1685-1687,  1988]  investigated  the  detectability  of  a  brief  tonal 
signal  in  the  presence  of  a  long  duration  wideband  masking  noise.  While  the  ‘'overshoot”  effect  (E. 

Zwicker,  J.  Acoust.  Soc.  Am.  653-663,  1965]  was  observed  for  diotic  stimuli  (NOSO),  no  overshoot  was 
observed  with  dkhotic  stimuli  (NOSr).  (Overshoot  is  defined  as  the  difference  between  the  threshold  fw  a 
signal  whose  onset  is  near  the  beginning  of  the  masker,  and  die  threshold  for  a  signal  uiiose  oAet  is  near 
the  end  of  the  masker).  Comparable  data  [D.E.  Robinson  and  C.  Trahiotis,  Percept.  Psychophys.  U:  333- 
334,  1972;  C.  Trahiotis,  T.R.  Dolan,  and  T.H.  Miller,  Percept.  Psychophys.  ^2:  335-338,  1972]  indicate 
no  overshoot  under  the  monaural  condition,  but  about  6  dB  of  overshoot  under  die  binaural  condition.  In  a 
recent  experiment  we  found  4-8  dB  of  overshoot  under  bodi  NOSO  and  NOSii  condition  when  a  wideband 
masker  was  used,  but  no  overshoot  under  either  condition  when  a  narrowband  masker  was  used.  McFadden 
had  used  a  750  Hz  signal  frequency,  whereas  most  of  these  studies,  including  ours,  used  a  5(X)  Hz  signal 
frequency.  To  eliminate  the  unlikely  possibility  that  this  small  difference  in  frequency  could  have  produced 
the  observed  discrepancy,  we  conducted  additional  measurements  using  both  500  Hz  and  750  Hz  signals  and 
wideband  maskers.  The  results  showed  no  difference  in  overshoot  for  the  two  signal  frequencies,  but  also 
showed  very  little  overshoot  for  most  subjects  under  either  die  NOSO  or  the  NOSr  condition.  The  total 
pattern  of  results  for  our  two  experiments  indicates  that  there  is  considerable  between  subject  variability  in 
the  observed  overshoot. 

Psychophysical  evaluation  of  a  physioloRically  based  model  of  auditory  processing.  In  the  classical 
literature  bodi  the  masking  and  the  suppression  of  <»e  tone  by  a  second  tone  of  lower  frequency  have  been 
shown  to  be  nonlinear  functions  of  overall  level.  We  have  partially  replicaied  the  experiments  of  Wegei  and 
Lane  [Physioi.  Rev.  23:  226-285,  1924]  on  remote  masking  and  of  Duifiiuis  [J.  Acoust.  Soc.  Am.  914- 
927,  1980]  on  suppression,  using  modem  adaptive  psychophyskai  techniques  and  the  same  subjects  in  both 
experiments.  The  data  are  comparable  to  those  from  earlier  studies  and  agree  with  the  Multiple  Band  Pass 
Nonlinearity  (MBPNL)  model  (Goldstein,  1989,  1990,  op.  cit.).  This  model  is  based  on  current  knowledge 
of  auditory  physiology  and  describes  the  response  of  the  peripheral  auditory  system  at  each  frequency  as  the 
result  of  a  nonlinear  interaction  between  a  linear  lowpass  ('tail”)  filter  and  a  compressive  bant^iass  (*tip”) 
filter.  This  view  suggests  that  bodi  excitatory  and  suppressive  mechanisms  influence  remote  masking.  The 
data  indicated  that  the  rdative  influence  of  these  two  mechanisms  varies  firom  subject  to  subject.  Bstimatrs  of 
the  exponent  of  the  compressive  nonlinearity  of  the  model  obtidaed  from  the  simuHaneous  masking 
experiment  agree  with  dmse  obtained  from  the  suppresskm  and  forward  masking  experiments.  Re- 
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eumination  of  the  simultaneous  maaUng  data  that  Gagn6  [J.  Acoust.  Soc.  Am.  2311-2321,  1988] 
obtained  with  hearing-impaired  subjects,  indicates  that  they  are  also  compatible  with  the  MBPNL  model,  if  it 
is  assumed  that  the  tip  filler  is  damaged  (gain  set  to  zero).  The  results  of  these  experiments  were  presented 
to  the  Acoustical  Society  of  America  [Goldstein.  Oilkey.  and  (^iilteez,  J.  Acoust.  Soc.  Am.,  K,  S24(A). 
1989]. 

Molecular  parchonhyskal  analyses  of  medeii  nf  mashlnt 

In  most  studies  of  auditory  masking,  including  those  described  above,  both  the  stimulus  and  the 
performance  of  the  subjects  are  described  by  their  statistical  properties  (e.g.,  the  average  power  of  the 
stimulus  and  the  average  probability  of  a  correct  response).  The  ouQmts  of  models  are  described  by  their 
distributional  properties  aitd  the  average  performance  of  a  model  is  fit  to  the  average  performance  of  a 
subject.  Another  approach  was  described  by  Green  (Psychol.  Rev.  71:  392-407,  1964)  and  referred  to  as 
'motecular*  psychophysics.  In  this  approach,  rqtroducible  noise  is  used  as  a  masker,  such  diat  die  stimulus 
can  be  specified  exacdy  on  every  trial.  Similarly,  the  responses  of  the  subject  are  considered  on  a  trial-by¬ 
trial  basu.  The  ou^iuts  of  models  are  determined  for  each  stimulus  and  the  fit  of  the  model  is  evaluated  by 
comparing  these  outputs  to  the  associated  responses  of  the  subjects. 

The  relation  between  monaural  and  binaural  masking.  The  large  masking  level  difierence  observed 
between  monaural  and  biiwural  tone-in-noise  masking  tasks  has  been  used  to  suggest  that  quite  different 
processing  is  employed  under  the  two  conditions  (e.g.,  energy  detection  vs.  interaural  time  processing). 
However,  when  Gilkey  et  al.  (J.  Acoust.  Soc.  Am.78:  1207-1219,  1985]  examined  the  reqwnses  of  subjects 
to  individual  wideband  reproducible  noise  samples,  they  found  that  the  responses  under  the  NOSO  and  NOSX 
conditions  were  highly  correlated.  On  the  other  hand,  when  Isabelle  and  Cfolbum  [J.  Acoust.  Soc.  Am.  82: 
109(A),  1987]  examined  the  responses  to  individual  narrowband  reproducible  noise  samples,  they  found 
correlations  that  were  much  weaker  and  often  negative.  They  attributed  the  differences  between  their  d«a 
and  those  of  Gilkey  et  al.  to  the  differences  in  the  bandwidth  of  the  masker.  If  so,  this  would  suggest  that 
the  correlation  observed  by  Gilkey  et  al.  would  more  qipropriately  be  attributed  to  similarities  in  across 
critical  band  processing  rather  than  to  similarities  in  within  critical  band  (wocessing,  as  Gilkey  et  ri.  had 
implied. 

To  investigate  further  the  effect  of  masker  bandwidth,  the  experiment  of  Gilkey  et  al.  was  replicated 
using  both  wideband  (100-3000  Hz)  and  narrowband  (third  octave)  maskers.  Although  the  corrdation 
between  NOSO  and  NOSH  performance  was,  in  general,  somewhat  weaker  foi  the  narrowband  condition  all 
observed  correlations  were  significant  (p<  <  .001),  reaffirming  the  strong  correlation  between  NOSO  and 
NOSr  performance.  Again,  this  result  has  significant  implications  for  models  of  both  monav  i  and  binaural 
performance.  It  is  typically  imomed  that  monaural  performance  is  governed  by  the  energy  in  the  stimulus, 
while  binaural  performance  is  related  to  interaural  differences  in  the  stimulus,  particularly  interaural 
differences  in  time.  The  resuits  of  this  experiment  imply  that  binaural  poformance  mi^  be  based  on  an 
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energy-like  cue  (e.g.,  the  E-C  model),  or  that  monaural  performance  might  be  based  on  a  timing-like  cue 
(e.g.,  the  model  of  Bilsen  and  Goldstein  [J.  Acoust.  Soc.  Am.  292-296,  1974]). 

More  recent  studies  extend  this  approach  to  conditions  employing  interaurally  uncorrelaied  noise.  The 
NOSO  and  NOSlI  conditions  show  very  different  performance  at  the  molar  level,  but  very  similar  performance 
at  the  molecular  level.  We  have  shown  that  when  the  interaural  correlation  of  the  extemai  noise  is  1 .0,  the 
output  of  the  E-C  mechanism  is  highly  correlated  with  the  waveforms  in  the  monaurai  channels.  Said 
differentiy,  the  effective  maskers  under  the  NOSO  and  NOSx  conditions  are  highly  correlated.  Therefore,  the 
observed  correlation  between  the  NOSO  and  the  NOSn  molecular  responses  of  human  subjects  is  expected.  On 
the  other  hand,  it  might  be  expected  that  even  though  the  NOSO  and  NUSO  conditions  yidd  similar 
performance  on  the  molar  level,  quite  different  performance  would  be  seen  when  the  data  are  analyzed  on  the 
molecular  level.  Under  the  NUSO  condition  the  interaural  correlation  of  the  masker  is  zero.  The  E-C  model 
would  argue  that  the  effective  masker  under  these  conditions  is  only  partialiy  correlated  with  ehber  of  the 
maskers  in  the  monaural  channels.  We  collected  molecular  psychoftiiysical  data  under  the  NOSO,  NOSil, 
NUSO,  and  NUSK  conditions.  Under  the  NOSO  condition  performance  was  measured  separately  for  the 
waveforms  reaching  each  ear  under  the  binaural  conditions.  The  strong  correlation  between  NOSO  and 
NOSll  molecular  responses  was  replicated.  As  expected,  the  responses  under  either  the  NUSO  or  the  NUSit 
condition  were  not  well  predicted  by  the  responses  under  the  NOSO  condition  to  either  monaural  masker 
(i.e.,  the  maskers  in  each  ear  under  the  NU  conditions).  However,  to  our  surprise,  when  the  NOSO 
responses  to  the  maskers  in  the  two  ears  were  averaged  and  used  to  predict  the  responses  under  the  NU 
conditions  the  correlations  were  again  quite  strong.  These  results  will  be  compared  to  the  detailed 
predictions  of  the  E-C  model  and  to  the  predictions  of  lateralization  models. 

Improved  molecular  psychophysical  methods.  Although  the  molecular  psychophysical  approach  has 
proven  extremely  useful,  data  collection  is  slow.  The  amount  of  data  needed  b  increased,  over  that  obtained 
in  a  molar  experiment,  by  a  factor  of  approximately  N,  where  N  is  the  number  of  reproducible  noise  samples 
employed,  because  the  approach  requires  us  to  estimate  the  vaiue  of  the  subjects’  decision  variable  in 
response  to  each  noise  sample.  In  the  past  we  have  inferred  this  value  based  on  binary  responses  in  a  simple 
yes/no  detection  task.  The  use  of  binary  responses  places  a  (nactical  limit  on  the  amount  of  information  that 
can  be  transmitted  on  each  trial  (W.  R.  Gamer  and  H.  W.  Hake,  Psychol.  Rev.  M,  446-459,  ISISl).  In 
addition,  the  approach  requires  a  number  of  additional  assumptions  that  have  not  been  tested.  Finally,  the 
variability  and  the  expected  value  of  the  estimate  of  the  decisfon  variable  are  not  independent. 

With  these  problems  in  mind,  we  have  been  developing  a  continuous  rating  procedure  based  on  the 
procedure  of  Watson,  Rilling  and  Bourbon  [J.  Acoust.  Soc.  Am.  X,  283-288,  1964].  The  procedure  is 
straightforward;  the  subjects’, task  is  to  use  a  mouse  to  position  a  cursor  along  the  bottom  of  a  CRT  screen  to 
indicate  his  confidenoe  that  a  signal  was  presented  on  a  particular  trial.  Positioning  the  cursor  to  the  right  of 
the  screen  indicates  confidence  that  the  signal  was  present,  while  positioning  the  cursor  to  the  left  indicaaes 
confidenoe  that  the  signal  was  not  presented.  Two  simple  experiments  were  conducted  to  evaluate  the 
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procedure.  In  the  first,  the  subjects’  task  was  to  determine  whether  a  particular  three  digit  number  presented 
on  the  CRT  screen  was  drawn  from  a  population  of  "noise  alone"  numbers  or  from  a  population  of  "signal 
plus  noise”  numbers.  In  the  second,  the  subjects'  task  is  to  detect  the  presence  of  a  tonal  signal  in  the 
presence  of  a  broadband  noise  masker.  Preliminary  analyses  of  the  visual  data  indicate  that  the  subjects  can, 
with  training,  produce  reliable  rating  judgments.  The  expected  value  of  these  judgments  appears  to  be  a 
simple  fun;.tion  of  the  stimulus  magnitude.  Ratings  obtained  for  rqteated  presentations  of  the  same  stimulus 
are  consistent.  Thus,  subjects’  ratings  can  be  used  to  reproduce  stimulus  distributions  (i.e.,  by  deriving 
receiver  operating  characteristics  or  fiequency  histograms).  Molecular  psychophysical  data  collected  with 
this  technique  will  be  compared  to  molecular  psychophysical  data  obtained  using  binary  responses. 
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The  threshold  of  a  short  interaurally  phase-invoted  probe  tone  (20  ms,  500  Hz,  S, )  was 
obtained  in  the  presence  of  a  7S0-ms  noise  masker  that  was  switched  after  373  ms  from 
interaurally  phase-inverted  ( N„ )  to  interaurally  in-phase  ( N„ ) .  As  the  delay  between  probe- 
tone  offset  and  noise  phase  transition  is  increased,  the  threshold  decays  from  the  N„S, 
threshold  (masking  level  difference  =  0  dB)  to  the  N„S,  threshold  (masking  level 
difference  =  15  dB).  The  decay  in  this  “binaural”  situation  is  substantially  slower  than  in  a 
comparable  “monaural”  situation,  where  the  interaural  phase  of  the  masker  is  held  constant 
(N, ),  but  the  level  of  the  masker  is  reduced  by  15  dB.  The  prolonged  decay  provides  evidence 
for  additional  binaural  sluggishness  associated  with  “binaural  forward  masking.”  In  a  second 
experiment,“binaural  backward  masking”  is  studied  by  time  reversing  the  maskers  described 
above.  Again,  the  situation  where  the  phase  is  switched  from  N„  to  N,  exhibits  a  slower 
transition  than  the  situation  with  constant  interaural  phase  (N„ )  and  a  15-dB  increase  in  the 
level  of  the  masker.  The  data  for  the  binaural  situations  are  compatible  with  the  results  of  a 
related  experiment,  previously  reported  by  Grantham  and  Wightman  [  J.  Acoust.  Soc.  Am.  65, 
1509-1517  ( 1979)  ]  and  are  well  fit  by  a  model  that  incorporates  a  double-sided  exponential 
temporal  integration  window. 

PACS  numbers:  43.66.Pn,  43.66.Nm,  43.66.Dc,  43.66.Mk  [WAY] 


INTRODUCTION 

Since  the  pioneering  work  of  Hirsh  (1948),  the  phe¬ 
nomena  of  binaural  masking  have  been  extensively  investi¬ 
gated  (see  Durlach  and  Colburn,  1978,  for  a  review).  The 
vast  majority  of  studies  have  focused  upon  stationary  condi¬ 
tions,  that  is,  situations  where  the  interaural  parameters  of 
the  signal  and  of  the  masker  are  fixed  within  a  given  trial. 
This  contrasts  with  most  real-world  listening  situations 
where  the  interaural  parameters  of  the  stimulus  fluctuate  as 
a  result  of  both  head  and  source  movement.  Obviously,  the 
temporal  properties  of  the  binaural  system  play  an  impor¬ 
tant  role  in  these  situations,  and  an  analysis  of  these  proper¬ 
ties  is  necessary  if  we  hope  to  understand  human  perception 
in  the  complex  stimulus  situations  of  everyday  life. 

Some  insights  into  the  temporal  limits  of  binaural  pro¬ 
cessing  have  been  obtained  by  direct  extension  of  techniques 
previously  applied  to  monaural  phenomena.  For  example, 
Blodgett  el  al.  (1958)  and  Green  (1966)  investigated  the 
detectability  of  a  tonal  signal  in  the  presence  of  a  continuous 
noise  masker  as  a  function  of  the  duration  of  the  signal  and 
found  that  the  ear  can  integrate  energy  linearly  up  to  a  maxi¬ 
mum  integration  time  of  100-200  ms  under  both  “monau¬ 
ral”  and  “binaural”  conditions.' 

In  contrast,  while  under  monaural  conditions  the  de¬ 
tectability  of  a  brief  probe  tone  is  almost  independent  of  the 
duration  of  a  longer  simultaneous  masker,  under  binaural 


conditions  detectability  increases  dramatically  with  masker 
duration  until  the  duration  of  the  portion  of  the  masker  that 
precedes  or  follows  the  probe  tone  exceeds  2(X)-600  ms.  Al¬ 
though  the  results  of  these  experiments  are  not  completely 
consistent,  it  has  been  assumed  that  the  binaural  system  re¬ 
quires  several  hundred  milliseconds  to  determine  and  re¬ 
spond  to  the  parameters  of  the  masker  (see,  for  example, 
McFadden,  1966;  Robinson  and  Trahiotis,  1972;  Trahiotis 
etal,  1972;  Zwickerand  Zwicker,  1984;  Yost,  1985;  Kohl- 
rausch,  1986;  Kollmeier,  1986). 

When  a  brief  probe  tone  is  presented  after  the  termi¬ 
nation  of  a  masker  (i.e.,  forward  masking),  the  decay  of 
masking  is  found  to  be  more  gradual  under  binaural  condi¬ 
tions  than  under  monaural  conditions.  Similar  results  are 
also  found  for  backward  masking.  That  is,  under  both  for¬ 
ward  and  backward  masking  conditions,  the  masking  level 
difference  (MLD,  i.e.,  the  difference  of  the  masked  thresh¬ 
old  in  monaural  versus  binaural  conditions)  decreases  as  the 
signal  is  moved  away  from  the  masker  in  time  (see  Snudl  et 
al.,  1972;  Wightman,  1973;  Berg  and  Yost,  1976).  Unfortu¬ 
nately,  there  are  several  possible  interpretations  of  these  re¬ 
sults  that  have  not  been  resolved  in  the  literature.  One  possi¬ 
bility  is  that  the  change  in  the  level  of  the  effective  masker  in 
bituural  channels  is  more  gradual  than  in  moiuura]  chan¬ 
nels.  Another  explanation  is  that  information  about  the  in¬ 
teraural  phase  relation  of  the  masker  exists  only  during  the 
time  the  masker  is  present  (Kohlrausch  and  Fassel,  1988). 
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A  third  explanation  notes  that  both  monaural  and  binaural 
thresholds  approach  the  same  value  (i.e.,  absolute  thresh¬ 
old  )  as  the  probe  tone  is  moved  away  from  the  masker.  Said 
differently,  the  magnitude  of  the  MLD  is  known  to  decrease 
with  decreases  in  monaural  masking.  Thus  we  would  expect 
the  MLD  to  decrease  because  monaural  of  masking  de¬ 
creases  as  the  probe  tone  is  moved  away  from  the  masker. 

A  problem  in  interpreting  the  data  from  all  these  ap¬ 
proaches  is  that  the  responses  of  both  the  monaural  and  the 
binaural  systems  have  to  be  considered.  That  is,  the  stimulus 
manipulations  influence  both  monaural  and  binaural  “chan¬ 
nels"  and  it  is  difficult  to  attribute  the  observed  temporal 
effects  to  a  particular  system.  Thus  another  approach  to 
studying  temporal  effects  in  binaural  unmasking  solves  these 
problems  by  using  noise  maskers  without  monaural  changes, 
but  with  a  temporally  varying  interaural  correlation. 
Grantham  and  Wightman  (1979)  used  a  short  interaurally 
pha.se-inverted  (S. )  probe  tone  in  the  presence  of  a  contin¬ 
uous  noise  masker  whose  interaural  correlation  varied  sinu¬ 
soidally  between  —  1.0  and  1.0.  When  the  rate  of  modula¬ 
tion  is  slow,  an  MLD  is  obtained  when  the  probe  tone  is 
presented  at  a  time  when  the  interaural  correlation  of  the 
masker  is  positive  and  no  MLD  is  found  when  the  masker 
correlation  is  near  —  1 .0.  As  the  modulation  frequency  in¬ 
creases,  the  difference  in  MLD  between  the  positive  and  neg¬ 
ative  interaural  correlation  decreases  rapidly  and  levels  off 
for  modulation  frequencies  above  4  Hz.  This  cutoff  frequen¬ 
cy  is  much  lower  than  those  obtained  in  comparable  monau¬ 
ral  experiments  with  amplitude-modulated  stimuli  (Vie- 
meister,  1977).  Grantham  and  Wightman  (1979)  termed 
this  insensitivity  to  rapidly  varying  binaural  cues  as  "binau¬ 
ral  sluggishness”  and  estimated  a  "binaural  minimum  inte¬ 
gration  time"  of  44-243  ms. 

The  evidence  of  binaural  sluggishness,  the  concept  of  a 
binaural  minimum  integration  time,  and  the  order  of  magni¬ 
tude  of  these  time  constants  agree  well  with  temporal  prop¬ 
erties  of  localization  (Blauert,  1968),  lateralization 
(Blauert,  1972;  Grantham  and  Wightman,  1978; 
Grantham,  1984),  and  binaural  correlation  discrimination 
(Pollack,  1978;  Grantham,  1982).  In  most  of  these  experi¬ 
ments,  however,  the  average  performance  of  the  binaural 
system  is  obtained  for  stimuli  with  periodically  changing  in¬ 
teraural  parameters,  and  the  transient  properties  of  the  bin¬ 
aural  system  in  response  to  a  rapid  change  in  parameters  are 
revealed  only  indirectly. 

In  the  experiments  described  here,  the  change  in  the 
detectability  of  a  brief  probe  tone  is  observed  in  response  to  a 
single  ( nonperiodic )  transition  in  the  “effective  level”  of  the 
masker.  This  transition  is  introduced  by  rapidly  changing 
the  interaural  phase  of  the  masker  or  by  changing  the  overall 
level  of  the  masker  in  both  ears.  The  conditions  employed 
are  analogous  to  forward  and  backward  masking  conditions. 
However,  while  performance  in  response  to  the  overall  level 
change  is  assumed  to  be  governed  by  the  monaural  system, 
the  interaural  phase  change  will  not  alter  the  level  of  the 
effective  masker  in  the  monaural  channels;  thus  perfor¬ 
mance  is  assumed  to  be  governed  by  the  binaural  system. 
Hence,  the  transient  properties  of  the  binaural  system 
should  be  revealed. 
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I.  METHOD 

A.  Subjects 

Two  female  and  two  male  college  students,  aged  be¬ 
tween  19  and  25  years,  were  paid  for  participation  in  the 
experiments.  All  had  clinically  normal  hearing  and  received 
at  least  20  h  of  training  before  data  collection  began. 

B.  Apparatus 

Signal  and  noise  stimuli  were  generated  on  a  Data  Gen¬ 
eral  NOVA  4x  computer.  They  were  produced  through  sep¬ 
arate  12-bit  digital-to-analog  converters  for  each  binaural 
channel  at  a  sampling  rate  of  10  000  samples/s  and  passed 
through  5-kHz  low-pass  elliptical  filters.  The  level  of  the 
signal  was  controlled  by  separate  programmable  Charybdis 
attenuators  for  each  subject.  The  signal  and  noise  waveforms 
were  then  added  with  an  analog  mixer  and  presented  to  the 
subjects  through  TDH-49  headphones  mounted  in  (X)IA 
cushions.  The  four  subjects  were  seated  in  individual  sound¬ 
attenuating  chambers  during  the  experiment.  Timing  and 
response  recording  were  controlled  by  the  computer. 

C.  Stimuli 

In  each  interval,  the  masker  was  randomly  sampled 
from  the  output  of  a  33-bit  shift  register  whose  repetition 
period  was  5.2  days  (Gilkey  ei  a!.,  1988)  and  was  switched 
on  for  750  ms  without  shaping  the  envelope.  Transitions  in 
interaural  phase  or  in  overall  level  were  generated  digitally 
by  the  appropriate  computation  of  each  noise  sample  for 
each  binaural  channel.  The  D/A-converted  and  low-pass- 
filtered  masker  was  bandpass  filtered  from  100  to  2000  Hz 
with  a  Krohn-Hite  3270  filter  and  presented  to  the  subjects 
at  a  reference  spectrum  level  of  40  dB  SPL/Hz.  The  signal 
was  a  S(X)-Hz  sinusoid  with  a  total  duration  of  20  ms,  includ¬ 
ing  5-ms  raised-cosine  onset  and  offset  ramps. 


D.  Conditions 

The  four  conditions  with  masker  transitions  are 
sketched  in  Fig.  1 .  In  the  "binaural”  conditions,  an  interaur¬ 
ally  phase-inverted  noise  masker  ( N, )  is  switched  to  inter¬ 
aurally  in-phase  [N.,,  Fig.  1(a)]  or  an  N„  masker  is 
switched  to  N  ,  [  Fig.  1  ( e )  j . '  Conversely,  in  the  “monaural" 
conditions,  the  level  of  an  N„  masker  is  lowered  by  15  dB 
(Fig.  1(b)]  or  increased  by  15  dB  (Fig.  1(d)]. 

Two  of  the  conditions  were  used  in  experiment  I:  the 
N,N„S,  condition  (Fig.  1  (a)  ],  a  375-  ms  segment  of  inter¬ 
aurally  phase  inverted  noise  followed  by  a  37  S-ms  segment  of 
interaurally  in-phase  noise,  and  the  N,(  —  15  dB)N,S„ 
condition  (Fig.  1  (b)],  a  75()-ms  interaurally  phase-inverted 
noise  masker,  which  was  attenuated  by  1 5  dB  375  ms  after  its 
onset.  The  masked  threshold  of  the  interaurally  phase-in- 
verted  probe  tone  (S, )  was  measured  as  a  function  of  the 
delay  time  between  the  transition  of  the  noise  and  the  signal 
qjfser.  In  this  experiment,  the  delay  time  varied  from  —  180 
to  -(-  320  ms. 
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FIG.  1 .  Schematic  diagram  of  the  masker  conitguraticKis  and  signals  used  in 
the  experiments.  Panel  (a)  depicts  the  N,N„S-  configuration,  where  the 
interaural  phase  relation  of  the  masking  noise  is  switched  from  180*  to  0* 
without  changingihemasker  level.  PaneUb)  depicts theN,  I  —  15dB)N^ 
S,  configuration,  where  the  interaural  phase  of  the  masker  is  held  constant, 
but  the  masker  level  is  lowered  by  l5dB,  The  two  lower  panels  (c)  and  (d) 
show  the  “backward  masking”  situations  obtained  by  lime  reversing  the 
"forward  masking”  situations  sketched  in  the  ,:pper  panel.  Panel  (c)  de¬ 
picts  iheK.N.S^  configuration,  and  panel  (d)  depicts  the  {  -  15dB)N„ 
NpS.:  configuration.  The  threshold  of  a  20-ms  SOO-Hz  S.  probe  lone  is  ob¬ 
tained  as  a  function  of  the  delay  time  between  tone  offset  (panels  (a)  and 
(b)  ]  or  onset  {panels  (cl  and  (d)  ]  and  the  switching  of  the  masker. 


In  experiment  11.  the  same  maskers  as  in  experiment  I 
were  used,  but  in  a  reversed  temporal  order:  the  N,.N^S.. 
condition  (Fig.  1  (c)  |,  a  .t75-ms  segment  of  inleraurally  in- 
phase  noise,  followed  by  a  375-ms  segment  of  inleraurally 
phase-inverted  noise,  and  the  (  —  15  dB)N,N;,S.  condition 
[Fig.  l(d)),a750-msN„  masker,  which  is  attenuated  by  15 
dB  during  the  first  375-ms  segment  and  is  not  attenuated 
during  the  second  375-ms  segment.  Again,  the  masked 
threshold  of  the  S.,  probe  tone  was  determined.  In  this  exper¬ 
iment.  however,  the  threshold  was  measured  as  a  function  of 
the  delay  time  between  signal  onset  and  the  transition  of  the 
masker.  The  delay  time  varied  between  —  300  ms  and 
-I-  100  ms. 

In  addition,  three  reference  conditions,  referred  to  as 
"nontransient  conditions,"  were  employed,  where  both  the 
level  of  the  masker  and  its  interaural  phase  were  held  con¬ 
stant  throughout  the  750-ms  duration  of  the  masker:  the 
N.S.  condition  (both  noise  and  signal  inleraurally  phase 
inverted),  the  N„S, condition  (noise  interaurally  in-phase 
and  signal  interaurally  phase  inverted)  and  the  (  —  15  dB) 
N_S-  condition  (noise  and  signal  interaurally  phase  invert¬ 
ed,  but  the  masker  attenuated  by  15  dB  such  that  the  spec¬ 
trum  level  equaled  25  dB/Hz).  The  signal  was  temporally 
centered  in  the  masker. 

We  assume  that  performance  in  the  binaural  conditions 
[Fig,  1(a)  and  ( c )  j  is  determined  by  the  change  of  activity 
in  a  binaural  processing  system  and  that  performance  in  the 
monaural  conditions  [Fig.  1  (b)  and  (d)  j  is  determined  by 
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the  change  of  activity  in  a  monaural  processing  system.  In 
both  cases,  the  masked  threshold  of  a  short  500-Hz  S,  probe 
lone  is  assumed  to  represent  the  masker  activity  in  the  re¬ 
spective  channel  during  the  presence  of  the  probe  tone.  Since 
the  condition  sketched  in  Fig.  1(a)  is  similar  to  the  monau¬ 
ral  forward  masking  condition  sketched  in  Fig.  1(b),  we  call 
this  condition  "binaural  forward  masking."  Similarly,  the 
condition  sketched  in  Fig.  1  (c )  is  called  "binaural  backward 
masking." 

E.  Trial  structure  and  measurement  procedure 

A  two-interval  forced-choice  (21FC)  procedure  was 
employed.  Each  trial  started  with  a  198-ms  warning  light, 
followed  by  a  198-ms  pause  and  the  two  750-0"'  observation 
intervals,  separated  by  a  250-ms  interstimulus  interval.  At 
the  end  of  the  second  observation  interval,  a  1 500-ms  inter¬ 
val  was  allowed  for  responding.  In  both  observation  inter¬ 
vals,  a  40-ms  marking  light  was  turned  on  20  ms  before  the 
time  when  the  signal  might  occur.  An  additional  198-ms 
feedback  light  was  provided  to  mark  the  interval  that  actual¬ 
ly  contained  the  probe  tone.  Trial-by-trial  feedback  was  pro¬ 
vided. 

An  adaptive  staircase  algorithm  was  used  to  control  the 
probe-tone  level,  following  the  recommendations  of  Koll- 
meier  et  al.  ( 1988).  At  the  beginning  of  a  track,  the  signal 
level  was  set  well  above  the  expected  threshold  and  lowered 
by  1.0  dB  after  each  correct  response.  As  soon  as  the  first 
incorrect  response  was  recorded,  the  signal  level  was  in¬ 
creased  by  1.0  dB  and  a  "one  up/three  down"  rule  was 
adopted  ( Levitt,  1971),  which  lowered  the  signal  level  by  1 .0 
dB  after  three  successive  correct  responses  and  increased  the 
level  by  1 ,0  dB  after  one  incorrect  response.  Each  measure¬ 
ment  block  consisted  of  60  trials,  and  all  four  subjects  were 
tested  simultaneously.  The  threshold  estimate  was  obtained 
as  the  average  of  the  levels  presented  on  all  trials  after  the 
third  reversal.  Each  data  point  represents  the  median  thresh¬ 
old  estimate  of  at  least  four  independent  tracks  for  each  sub¬ 
ject. 

II.  RESULTS 
A.  Experiment  I 

The  individual  results  of  the  forward  masking  experi¬ 
ment  [cf.  Fig,  1(a)  and  (b)  j  are  given  in  Fig.  2(a)-(d)  for 
all  four  subjects.  The  open  squares  represent  median  thresh¬ 
olds  for  the  N,N„S,-  condition  and  the  triangles  the  thresh¬ 
olds  for  the  N,(  —  15  dB)N„S.  condition.  The  abscissa  de¬ 
notes  the  time  delay  between  the  masker  transition  and  the 
offset  of  the  probe  tone.  The  probe-tone  level  al  threshold  is 
plotted  on  the  ordinate.  The  0-dB  point  corresponds  to  each 
individual  subject’s  nontransient  N,,S,,  threshold.  The  sig- 
nal-to-noise  ratios  associated  with  these  individual  thresh¬ 
olds  are  given  in  Table  1.  The  arrows  in  the  right-hand  comer 
depict  the  individual  nontransient  N„S,,  thresholds  (solid 
arrows)  and  {  —  15  dB)N,S,  thresholds  (dotted  arrows). 

In  the  N,N„S„  condition  [open  squares  in  Fig.  2(a)- 
(d)],  a  relatively  high  threshold  level  (exceeding  the  non- 
transient  N„S,  threshold  by  up  to  2  dB)  is  obtained  for 
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TABLE  I.  Signal-to-noi$e  ratios  {E/N„  indB)  for  the  nontransient  N.S,, 
N„S-  and  (  -  15  dB)  N,S,  thresholds  for  each  individual  subject.  The 
(  -  l5dB)N,S.  ihresholdisrelatedtotheunatlenuatedmaskerlevelused 
for  (he  N^S.  and  N,.S,  condition.  In  addition,  the  difference  AL  denotes 
I  he  difference  between  the  individual  nontransient  \.S„  threshold  and  the 
N .  S .  threshold  value  that  has  been  (it  to  the  data  in  Fig.  2. 


Subject 

N-S,  (dB) 

N„S.(dB)  1 

[  -  15dB)N,S.(dB) 

AL  (dB) 

1 

1 1  6 

4.2 

-  .V7 

-  1  58 

2 

It.b 

-  5.1 

.V6 

-0.96 

\ 

117 

1.7 

-  2.9 

-  1.67 

4 

10  8 

-  2.8 

-  4.0 

-  1,40 

dela\  limes  less  than  100  ms.  Conversely,  a  relatively  low 
threshold  level  (appro.ximating  the  nontransient  N„S- 
threshold)  is  obtained  for  delay  limes  greater  than  200  ins. 
This  difference  m  threshold  level  agrees  with  the  N..S, 
Mi. I),  that  is.  the  difference  between  the  nontransient  N  .S. 
and  N  S.  thresholds  averaged  across  the  four  subjects, 
which  anumnts  to  15.0  dB.  For  probe-tone  delays  between 

UK)  and  2*K)  ms.  a  continuous  transition  from  the  high 
threshold  at  negative  delays  t('  the  low  threshold  at  large 
positive  delavs  is  observed. 

In  {'rder  to  compare  the  rate  of  threshold  decay  in  re¬ 
sponse  to  a  binaural  phase  transition  with  the  rale  of  a  com¬ 
parable  threshold  decay  in  response  to  a  monaural  level 
change,  the  data  of  the  N.(  —  15  dB)N_S..-  condition  are 
also  shown  in  Fig,  2(a)-(d)  astriangles,  Similar  to  the  “bin¬ 
aural"  case,  a  relatively  high  threshold  level  (exceeding  the 
nontrunsient  N  .S.  threshold  by  up  to  2  dB)  is  obtained  at 
delav  limes  less  than  —  50  ms.  Conversely,  a  relatively  low 
threshold  level  (approximating  the  nontiansient  (  —  15 
dH)N.S.  threshold]  is  obtained  at  delay  times  greater  than 
1(X)  ms.  This  difference  in  threshold  level  was  selected  to 
match  the  average  N.,S„  MLD  as  closely  as  possible  (i.e.. 

1 5.0  dB )  Again,  there  is  a  continuous  transition  between  the 
high  threshold  at  negative  delays  and  the  low  threshold  at 
large  positive  delays.  The  transition  is  more  rapid  than  that 
for  the  N.  N..S.  condition. 

B.  Experiment  II 

The  individual  results  of  the  backward  masking  experi¬ 
ment  [cf.  Fig.  1(c)  and  (d)]  performed  by  time  reversing 
the  masker  sequences  of  experiment  I  are  given  in  Fig.  2(e)- 
(h).  The  open  squares  represent  median  thresholds  for  the 
N..N\S.  condition,  and  the  triangles  show  the  median 
thresholds  for  the  (  —  15  dB)N„N^  S_  condition.  In  this 
experiment,  however.ihe  abscissa  denotes  the  time  delay 
between  the  masker  transition  and  the  onset  of  the  probe 
tone.  The  probe-tone  level  at  threshold  is  plotted  on  the  ordi¬ 
nate.  As  in  Fig.  2(a)-(d).  the  individual  nontransient  N^S., 
reference  level  corresponds  to  0  dB  and  the  nontransient  N„ 
S.  and  (  —  l5dB)N^S^  threshold  levels  for  each  individual 
subject  are  given  as  a  solid  arrow  or  dotted  arrow,  respective¬ 
ly 

The  conditions  of  our  experiments  are  analogous  to  tem- 
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poral  masking  conditions,  in  that  periods  of  reduced  mask¬ 
ing  precede  or  follow  periods  of  greater  masking.  To  a  good 
approximation,  the  results  depicted  in  Fig.  2(e)-(h)  are  a 
mirror  image  of  those  in  Fig.  2(a)-(d).  However,  as  with 
traditional  temporal  masking  experiments  (cf.  Fasti,  1976; 
Small  et  aL,  1972),  the  transitions  are  more  rapid  for  the 
backward  masking  case  [Fig.  2(e)-(h)]  than  for  the  for¬ 
ward  masking  case  {Fig.  2(a)-(d) ).  In  addition,  the  differ¬ 
ences  in  the  rate  of  threshold  change  between  monaural  and 
binaural  cases  suggest  that  the  binaural  system  reacts  more 
“sluggishly"  to  temporally  varying  stimuli,  al.so  compatible 
with  the  previous  literature  (e.g..  Grantham  and  Wightman, 
1979).  The  solid  line.s  and  dotted  lines  in  Fig.  2  denote 
threshold  functions  fit  to  the  data  using  the  approach  de¬ 
scribed  in  Sec.  III. 

III.  INTEGRATION  MODEL 

To  quantify  the  rate  of  change  in  threshold  for  the  bin¬ 
aural  and  monaural  forward  and  backward  masking  tasks 
described  above,  we  fit  functions,  with  a  minimum  number 
of  free  parameters,  to  the  data.  These  functions  are  shown  in 
Fig.  2  as  solid  lines  for  the  “binaural"  conditions  and  as 
dotted  lines  for  the  "monaural"  conditions.  In  this  section, 
the  assumptions  and  the  model  used  to  generate  these  func¬ 
tions  are  specified.  and  the  obtained  time  constants  arc  com¬ 
pared  to  the  "minimum  binaural  integration  lime"  discussed 
by  Grantham  and  Wightman  ( 1979). 

Our  model  incorporates  elements  of  the  "equalization- 
cancellation  (EC)  theory"  (Durlach,  1972).  as  well  as  some 
of  the  assumptions  described  by  Grantham  and  Wightman. 
to  obtain  a  relationship  between  the  amount  of  binaural 
masking  and  (he  delay  //me  /.  The  threshold  of  /he  probe 
tone  is  determined  by  the  instantaneous  masking  level  L(i) 
at  the  output  of  a  binaural  noise-reduction  priKessor  (eg., 
the  “cancellation"  mechanism  in  the  EC  theory),  operating 
in  the  critical  band  around  the  probe-tone  frequency.  The 
instantaneous  masking  level  Lit)  is  determined  by  the  aver¬ 
age  interaura!  cross-correlation  coefficient  r(/).  This  rela¬ 
tion  is  taken  from  the  EC  theory: 

Lit)  -  10Iog{(A'  +  1)/[A'  -  r(r)]}.  (!) 

where  L^  is  the  monaural  masked  threshold  level.  K  repre¬ 
sents  the  internal  noise  and  r(/)  is  the  lime-varying  average 
interaural  cross-correlation  coefficient.  Here.  /•(/)  is  ob¬ 
tained  by  a  weighted  integration  of  the  instantaneous  inter- 
aural  cross-correlation/jf  /)  during  the  preceding  instants  of 
time  (  —  oc  </’</)  or.  more  generally,  by  integrating  over 
an  infinite  range  of  preceding  and  successive  instants  of  time: 

'■(0=J  wU  -  (2) 

Here.  «'(/  —  /')  denotes  one  of  several  temporal  window 
functions  described  in  the  following  paragraph.  Although 
the  integration  window  extends  over  an  infinite  range  in  the 
future,  this  should  not  imply  that  the  subject's  performance 
violates  the  causality  principle  by  extracting  information  yet 
to  be  presented  in  the  future.  The  contradiction  can  be  re¬ 
solved  by  setting  the  window  to  zero  at  a  sufficiently  large 
positive  lime  (which  can  be  done  without  significantly  alter¬ 
ing  the  results)  and  by  allowing  the  subject  to  introduce  an 
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FIG.  2.  Panels  (a)-(d)  show  median  estimates  and  interquartile  ranges  of  the  masked  threshold  for  the  N,N„S.  configuration  (squares)  and  the  N, 

(  -  15dB)N,  S-  configuration  ( triangles)  as  a  function  of  the  delay  time  between  offset  ofihe  probe  tone  and  transition  in  the  masker.  Panels  (e)*{h)  show 
median  estimates  and  interquartile  ranges  for  the  N.,N.S,  configuration  (squares)  and  the  (  -  l5dB)N.  N.  $.  configuration  (triangles)  as  a  function  of 
the  delay  time  between  onset  of  the  probe  tone  and  transition  in  the  masker  The  individual  results  for  subjects  1-4  are  given  in  panels  (a)-(d)  and  (e)-{h). 
respectively.  TheO-dB  point  on  the  ordinate  denotes  the  individual  nontransient  N.S. -thresholds,  the  solid  arrows  at  the  ordinate  denote  the  nontransient 
N..6'- -thresholds,  and  the  dotted  arrows  denote  the  nontransient  (  -  I5dB)N.,$. -thresholds.  The  solid  lines  represent  mtxJel  functions  that  have  been  fitted 
to  the  "binaurar*  data  on  the  basis  of  a  double-sided  exponential  temporal  window.  Thedotted  lines  represent  model  functions  fit  to  the  “mnnaural''  data  with 
a  similar  algorithm  (sec  text  for  details). 


arbitrary  delay  between  the  sensory  input  and  his  judgment 
that  is  greater  than  or  equal  to  this  period  of  time 

Grantham  and  Wightman  (1979)  used  a  single-sided 
exponential  temporal  window  and  defined  its  time  constant 
as  the  "binaural  minimum  integration  time."  In  many  psy¬ 
chophysical  experiments,  however,  more  complex  temporal 
windows  are  required  to  account  for  the  data  (e.g.,  Moore  er 
al..  1988).  On  the  other  hand,  if  loo  many  free  parameters 
are  needed  to  describe  a  particular  temporal  window,  their 


fitted  values  will  have  little  explanatory  utility.  For  this  rea- 
s»m.  the  discussion  here  will  be  restricted  to  the  following 
temporal  window  functions  (each  of  which  requires  only 
two  parameters): 

( I )  rectangular  window 
luCt)  =  (r, -b  r,)  '  for 


=  0  elsewhere; 
(2)  triangular  window 


(3) 
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w(t)  =  2(t,  +  /)/(t  J  +  T.Tj)  for  -  r|<r<0 

=  2(r2  — +  TiTj)  for-0<f<T2 

=  0  elsewhere; 

( 3 )  Gauss  window 

w{t)=-  * 


(r,  +  T2)\!^ 

(4)  double-sided  exponential  window 

for  KO 


(r,  -(-  T;) 

_  exp(  —  //r,) 
(r,  r,) 


for  t  >  0; 


( 5 )  rounded  exponential  window 


U’(()  = 


1  -  0/r,) 

2(r,  -f  r,) 

1  4-  (f/r,) 


for  kO 


for  /  ->  0. 


(4) 


(6) 


(7) 


2(r,  +  r,) 

Note  that  the  integral  of  these  window  functions  is  normal¬ 
ized  to  unity.  The  "equivalent  rectangular  duration"  is  the 
inverse  of  the  maximum  value  of  the  respective  temporal 
window.' 

The  idealized  time-dependent  interaural  cross-correla¬ 
tion  pU)  is  derived  from  the  idealized  instantaneous  inter¬ 
aural  phase  of  the  noise  masker.  For  the  binaural  "forward 
masking"  experiment  N,,N„S,  [Fig.  2(a)-(d)j,  it  is  given 
by 

p{l)  =  -  1  for  l<0 


=  4-1  for  l>0.  (8) 

The  function  p(t){oT  the  binaural  "backward  masking"  ex¬ 
periment  N„N_S.  [Fig.  2(e)-(h)  ]  is  obtained  by  time  re¬ 
versing  Eq.  (8). 

By  inserting  a  window  function  selected  from  Eqs.(3)- 
(7)  and  Eq.  (8)  into  Eq.  (2),  Z.(/)  can  be  calculated  from 
Eq.  ( I )  for  both  the  binaural  forward  and  backward  mask¬ 
ing  experiment.  We  do,  however,  make  an  additional  as¬ 
sumption.  Presumably,  the  subject  can  manipulate  the  tem¬ 


poral  center  of  his  window  to  increase  the  effective 
signal-to-noise  ratio  (“off-time  listening”).  To  allow  for  this 
effect,  we  assume  that  the  subject  centers  his  window  at  the 
onset  of  the  signal  under  the  backward  masking  condition, 
and  at  the  offset  of  the  signal  under  the  forward  masking 
condition.  The  parameters  that  have  to  be  fit  to  the  data  of 
each  subject  are  the  time  constants  r,  and  r,  and  the  values 
of  L„  and  K.  In  principle,  f.„  is  given  by  the  nontransient 
N,S„  reference  threshold  and  K  is  determined  by  the  non¬ 
transient  N„S,  MLD  for  each  individual  subject.  In  the 
transient  conditions  with  a  change  in  the  interaural  masker 
phase,  however,  the  threshold  levels  for  very  large  positive 
and  large  negative  values  of  t  do  not  approach  these  non  tran¬ 
sient  thresholds,  but  are  up  to  2.0  dB  higher.  '  Since  this 
increase  in  threshold  level  is  roughly  the  same  before  and 
after  the  transition  of  the  masker,  only  the  value  of  L^,  has  to 
be  fit  to  the  data,  while  the  value  of  K  is  determined  by  the 
individual  nontransient  N„S.,  MLD.  Further,  the  three  free 
parameters  r, .  and  r,  are  fit  to  the  data  of  the  binaural 
forward  masking  experiment  (N,,N„S, )  and  the  backward 
masking  experiment  ( N„N.S, )  simultaneously.  Thus  a  sin¬ 
gle  least-squares  fit  between  the  data  and  the  two  model 
functions  is  found  using  the  Simplex  method  ( Nedler  and 
Mead,  1465).  The  time  constants  r,  and  r.  and  the  normal¬ 
ized  deviation  measure  obtained  for  each  integration 
window  are  given  in  Table  11  for  each  individual  subject.' 

Figure  3  shows  threshold  functions  for  the  five  choices 
of  the  binaural  integration  window  [Eqs.  (3)-(7) )  fit  to  the 
"binaural"  data  of  subject  1 .  The  data  points  are  taken  from 
Fig  2(a)  (condition  N.N.  S,,  triangles)  and  Fig.  2(e) 
(condition  N„N.S,.  inverted  triangles).  The  ordinate  and 
the  abscissa  are  the  same  as  in  Fig.  2. 

Obviously,  the  dotted  curves  in  Fig,  3.  which  were  com¬ 
puted  with  a  rectangular  window,  do  not  yield  an  adequate 
description  of  the  experimental  data.  The  long-dashed 
curves  in  Fig.  3  are  based  on  a  triangular  window  |  Eq.  (4)  ]. 
As  with  the  rectangular  window,  these  curves  do  not  give  an 
adequate  description  of  the  smooth  transition  in  threshold 
for  delays  smaller  than  —  50  ms  or  greater  than  50  ms.  For 
this  reason,  integration  windows  such  as  the  Gaussian,  the 
double-sided  exponential,  and  the  rounded  exponential  win- 


TABLE  It.  Time  constants  r,  and  r,  (in  ms),  tit  to  the  "binaural"  forward  and  backward  masked  threshold  data  from  Fig.  2,  for  five  different  shapes  of  the  in¬ 
tegration  window  and  each  individual  subject.  The  normalized  deviation  measure  (sec  foot  note  4)  isalso  included  for  the  data  obtained  with  each  subject. 


Subject  1 

Subject  2 

Subject  3 

Subject  4 

Rectangular  window 

r,/r,  (ms) 

53.1/56.4 

52.8/53.9 

8.3.2/92.8 

33.2/41.4 

■  0.894 

0.864 

0.895 

0.922 

Triangular  window 

r,/rj  (ms) 

77.5/102 

61.6/64.6 

137/129 

46.1/49.6 

^nl 

0.925 

0.909 

0.933 

0.946 

Gaussian  window 

r,/rs  (ms) 

-  5.5/37.2 

-  2.3/29.7 

0.5/53.1 

-  1.8/22.4 

0.941 

0.928 

0.941 

0.957 

Exponential  window 

r,/rj  (ms) 

22.2/27.6 

I9.8/2I.0 

42.7/40.5 

I6.1/17.I 

^nl 

0.963 

0.966 

0.956 

0.968 

Rounded  exponential  window 

r/fj  (ms) 

15.0/18.9 

13.6/14.6 

28.8/27.5 

10.6/11.4 

0.958 

0.952 

0.952 

0.965 
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delay  rime  (ms) 


FIG  3.  Comparison  of  different  functions  fitted  to  (he  binaural  data  of  sub¬ 
ject  I  from  Fig.  2.  The  upright  triangles  denote  the  N  ^  H.S,  condition,  and 
the  inverted  triangles  denote  theN„N^S,  condition.  Dotted  line:  rectangu¬ 
lar  window;  long-dashed  line:  triangular  window;  dash-dotted  line;  Gaus¬ 
sian  window:  solid  line:  double-sided  exponential  window;  short-dashed 
tine:  rounded  exponential  window. 


dow.  yield  more  appropriate  model  functions.  The  problem 
with  the  Gaussian  window  is  that  it  produces  the  same  slopes 
for  forward  and  backward  masking  (dashed-dotted  lines  in 
Fig.  3),  although  the  time  course  of  the  data  clearly  shows 
asymmetries  between  forward  and  backward  masking.  The 
double-sided  exponential  window  (Eq.  (6)i  provides  the 
best  lit  for  the  binaural  data  presented  here  (solid  line  in 
Fig.3)  since  the  normalized  "goodness  of  fit"  measure  B„, 
(cf.  Table  II )  takes  its  maximum  value  for  all  four  subjects. 
The  functions  fit  to  each  individual  subject's  data  with  this 
type  of  a  temporal  window  are  included  as  solid  lines  in 
Fig.2.  The  second  best  fit  is  obtained  with  the  rounded  expo¬ 
nential  window  [Eq.  (7) ).  The  model  functions  fitted  with 
this  window  ( short -dashed  lines  in  Fig.  3 )  deviate  only  mar¬ 
ginally  from  those  obtained  with  the  double-sided  exponen¬ 
tial  window. 

Although  the  single-sided  exponential  temporal  win¬ 
dow  was  not  specifically  examined  here  (because  its  predic¬ 
tions  for  the  backward  masking  case  differ  substantially 
from  the  data),  the  "equivalent  rectangular  durations"  for 
all  of  our  subjects  and  for  all  windows  considered  agree  well 
with  Grantham  and  Wightman’s  "binaural  minimum  inte¬ 
gration  time”  estimates  (between  44  and  243  ms). 

Although  Grantham  and  Wightman's  experiments  are 
rather  different  from  ours  and  the  range  of  their  time  con¬ 
stants  exceeds  the  range  of  the  time  constants  reported  here, 
the  relation  between  the  two  methods  should  be  considered 
in  greater  detail  to  assure  that  a  substantial  disagreement 
does  not  exist  between  both  sets  of  data.  Therefore,  we  at¬ 
tempt  to  predict  their  data  with  our  time  constants  for  the 
rectangular  and  the  double-sided  exponential  temporal  win¬ 
dow.  In  addition,  the  ability  of  a  double-  and  a  single-sided 
exponential  temporal  window  to  predict  their  data  is  exam¬ 
ined.  In  the  Appendix,  the  maximum  MLD  is  calculated  for 
Grantham  and  Wightman's  maskers  with  sinusoidally  vary¬ 
ing  interaural  correlation  as  a  function  of  the  modulation 
frequency.  The  predictions  for  the  rectangular  and  the  dou- 
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modulation  frequency  IHz) 


FIG.  4.  Predictions  for  the  data  of  Grantham  and  Wightman  { 1479 )  based 
on  a  rectangular  temporal  window  (dotted  corver  t,  +  t,  =175  ms),  a 
single-sided  exponential  temporal  window  (dash-dotted  curve;  r,  =  0,  r, 
=  181  ms),  and  a  double-sided  exponential  lempoml  window  (solid  curve: 
r,  =  42.7  ms.  r.  =  40.5  ms,  dashed  curve:  r,  =  =  1 14  ms).  The  solid 

and  dotted  curves  are  ba.sed  on  the  time  constants  estimated  for  sub)ect  5 
from  the  data  given  in  Fig.  2(c)  and  (g).  The  dash.dottcd  and  dashed 
curves  were  obtained  by  a  least -squares  5t  to  the  data  provided  in  this  figure. 
The  abscissa  denotes  Ihe  moduJalion  frequency  ofa  noise  masker  with  sinu¬ 
soidally  lime-varying  inleraural  correlation,  and  Ihe  ordinate  denotes  the 
masked  threshold  of  an  S.  probe  tone  presented  simultaneously  with  the 
occurrence  of  a  maximally  positive  interaural  correlation  of  the  masker. 
The  (MB  point  denotes  the  stationary  N.S.  threshold,  and  the  arrow  on 
Ihe  left-hand  side  denotes  the  stationary  N.,S.  threshold.  Squares  denote 
average  threshold  values  of  the  three  subjects  of  Grantham  and  Wightman 
(1979), 


ble-sided  exponential  temporal  window  are  plotted  in  Fig.  4 
using  Ihe  decay  constants  obtained  for  our  subject  3,  whose 
fitted  decay  constants  for  N,N„S,  and  N„N,S.  were  the 
largest  (i.e..  closest  to  the  middle  of  the  range  reported  by 
Grantham  and  Wightman ) .  The  dotted  curve  denotes  theo¬ 
retical  values  for  a  rectangular  window  (r,  -I-  Tj  =  175  ms) 
and  the  solid  line  denotes  the  curve  for  a  double-sided  expo¬ 
nential  window  ( T,  =  42.7  ms  and  Tj  =  40.5  ms) .  The  open 
squares  denote  Ihe  mean  MLD  values  for  500  Hz  for  the 
three  subjects  of  Grantham  and  Wightman  ( 1979),  as  sup¬ 
plied  by  one  of  the  authors.  The  normalized  deviation  from 
these  data  for  the  curves  predicted  for  our  subject  3  is  B„, 
=  0.840  ( rectangular  window,  dotted  line  in  Fig.  4)  and  B„, 
=  —  0.363  ( double-sided  exponential  window,  solid  line  in 
Fig.  4).  Both  curves  are  below  the  data  supplied  by 
Grantham  and  Wightman  for  all  modulation  frequencies, 
indicating  that  even  the  largest  time  constants  found  in  our 
study  are  below  the  values  required  to  predict  Grantham  and 
Wightman's  average  data.  However,  since  the  deviation 
between  the  dotted  curve  and  Grantham  and  Wightman's 
data  is  small,  the  first-order  prediction  from  our  data  based 
on  a  rectangular  temporal  window  is  in  approximate  agree¬ 
ment  with  Grantham  and  Wightman's  findings. 

The  dashed-dotted  curve  and  the  dashed  curve  were  fit¬ 
ted  to  the  average  data  by  using  a  single-sided  exponential 
window  (r,  =  0,  Tj  =  181  ms,  =  0.841 )  and  a  double¬ 
sided  exponential  window  (r,  =r2  =  II4  ms, 
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TABLE  111.  Same  as  Table  11  for  the  “nionaural'’  forward  and  backward  masked  thresholds  fn»i  Fig.  2. 


Subject  1 

Subject  2 

Subject  3 

Sui:^4 

RecanUngular  window 

r,/r2  (ms) 

13.3/15.8 

20.8/17.0 

32.0/26.4 

12.6/12.5 

^nl 

0.943 

0.888 

0.910 

0.961 

Triangular  window 

T,/Ty  (ms) 

19.9/21.4 

34.9/28.7 

44.3/37.0 

18.4/19.6 

0.950 

0.891 

0.919 

0.964 

Gaussian  window 

r,/r2  (ms) 

-0.1/8.7 

2.6/9.6 

3.4/13.0 

-  0.02/7.8 

0.954 

0.909 

0.928 

0.964 

Exponential  window 

Tj/Tj  (ms) 

6.5/6.4 

1 1.4/8.7 

14.7/11.3 

5.9/6.0 

■ffnl 

0.947 

0.900 

0.911 

0.950 

Rounded  exponential  window 

Tj/Tj  (ms) 

44/4.5 

7.5/59 

9.8/7.8 

4.0/4.2 

0.905 

0.905 

0.918 

0.957 

=  0.908),  respectively.  As  can  be  seen,  Grantham  and 
Wightman’s  average  data  are  about  equally  well  fit  using  a 
single-  and  a  double-sided  exponential  window. 

In  terms  of  a  systems  analysis  approach  applied  to  the 
binaural  unmasking  mechanism  (such  as  the  EC  device), 
our  data  describe  something  analogous  to  the  “envelope  step 
response,”  while  the  data  of  Grantham  and  Wightman 
(1979)  and  Grantham  (1982)  describe  something  analo¬ 
gous  to  the  “modulation  transfer  function”  (i.e.,  the  modu¬ 
lation  transfer  function  is  defined  as  the  amount  of  modula¬ 
tion  detectable  at  the  output  of  the  system  in  response  to  an 
envelope-modulated  signal,  while  Grantham  and  Wightman 
measured  the  response  to  a  signal  whose  interaural  correla¬ 
tion  was  modulated).  If  the  system  can  be  approximated  as 
linear  and  time  invariant,  both  descriptions  would  be  equiva¬ 
lent  and  the  formulas  given  in  the  Appendix  would  give  the 
appropriate  relation  between  our  approach  and  Grantham 
and  Wightman's  approach:  An  exponentially  rising  and  de¬ 
caying  step  response  would  correspond  to  an  exponential 
temporal  window  and  a  modulation  transfer  function  given 
by  Eq.  (A5),  which  decreases  with  ( \/f„ )’.  On  the  other 
hand,  a  modulation  transfer  function  decreasing  with 
( 1  //„. )  would  correspond  to  either  a  rectangular  or  a  single¬ 
sided  exponential  temporal  window.  Hence,  the  slope  of  the 
modulation  transfer  function  for  large  values  of  f„  could 
differentiate  between  different  types  of  temporal  windows. 
Unfortunately,  this  slope  cannot  be  estimated  precisely  from 
experimental  data  because  the  MLD  decreases  rapidly  as  the 
modulation  frequency  increases.  Therefore,  no  significant 
difference  is  observed  for  the  single-  and  double-sided  expo¬ 
nential  temporal  windows  in  predicting  Grantham  and 
Wightman's  data  (cf.  Fig.  4).  However,  large  differences 
between  different  types  of  temporal  windows  are  observed 
for  our  data,  which  therefore  appear  to  be  more  appropriate 
for  describing  the  interaural  correlation  averaging  process. 

One  significant  difference  between  our  data  and 
Grantham  and  Wightman's  data  is  the  fact  that  the  time 
constants  of  the  double-sided  exponential  window  required 
to  fit  their  average  data  are  higher  than  those  obtained  from 
our  data  (cf.  Table  II).  This  fact  also  holds  for  the  time 
constants  of  the  rectangular  window.  However,  since  the 
four  modulation  frequencies  employed  by  Grantham  and 
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Wightman  might  not  provide  a  sufficient  database  to  dis¬ 
criminate  among  the  different  two-parameter  windows,  only 
a  qualitative  agreement  with  our  data  should  be  expected. 
This  agreement  is  established  by  the  fact  that  our  range  of 
rectangular  window  widths,  and  equivalent  rectangular  du¬ 
rations  of  the  double-sided  exponential  windows  ( with  the 
exception  of  subject  4),  is  completely  contained  within  their 
range  of  time  constants  for  the  single-sided  exponential  win¬ 
dow  (44-243  ms). 

In  order  to  compare  the  time  constants  obtained  for  the 
binaural  conditions  N,,N„S,,  and  N,.N.,S,,  with  those  for 
the  monaural  conditions  N.(  —  15  dBIN.S^  and  (  -  15 
dB)N.N.S.,,  the  same  algorithm  was  used  to  fit  theoretical 
curves  to  the  data.  The  dotted  lines  in  Fig.  2  denote  these 
functions  generated  with  a  double-sided  exponential  tempo¬ 
ral  window.  However,  since  the  interaural  correlation  of  the 
masker  did  not  change  in  these  conditions,  we  rewrite  Eq. 
(1)  as 

t(/)  =  i  ,,  -f  lOlogH  -!-(>(/)).  (9) 

where  L  |^  jb  =  L„  —  10  log(A  1  )/(K  -  1 )  denotes 
the  nontransient  (  —  15dB)N,S,  threshold,  and  f>(r)  is  in¬ 
terpreted  as  the  weighted  average  of  that  part  of  the  masker 
power  that  exceeds  the  —  15-dB  level  of  the  masker.'  For 
this  reason,  Eq.  (9)  simply  describes  a  power-law  additivity 
of  monaural  thresholds:  In  the  nontransient  N„S„  case 
|*(f)  =  2/(Ar  -  1 )  ]  or  (  -  15  dB)N,S,  case  [*(r)  =  0], 
the  respective  threshold  values  L„mL  ,,  btc  obtained, 
whereas  the  transient  properties  of  the  threshold  are  deter¬ 
mined  by  a  moving  weighted  average  of  the  masker  power. 
Similar  approaches  have  been  described  by  Robinson 
( 1 974)  and  Moore  ei  al.  ( 1 988 ) .  The  time  constants  derived 
for  different  types  of  temporal  windows  are  given  in  Table 
III.  They  are  substantially  smaller  than  those  obtained  for 
the  binaural  data  with  the  same  temporal  window  function, 
and  tend  to  be  higher  than  those  obtained  by  Moore  el  al. 
( 1988).  In  addition,  the  Gaussian  temporal  window  appears 
to  predict  our  monaural  data  better  than  the  exponential 
window  (which  provides  the  best  fit  for  our  binaural  data) 
and  a  rounded  exponential  temporal  window  (which  pro¬ 
vides  the  best  fit  for  the  data  of  Moore  el  al.,  1 988 ) .  Since  the 
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dynamic  range  of  the  data  presented  here  is  small  in  com¬ 
parison  to  the  measurement  accuracy  of  about  I  dB,  only 
properties  of  the  central  part  of  the  temporal  window  can  be 
estimated,  whereas  its  shape  at  times  remote  from  the  center 
and  its  total  dynamic  range  cannot  be  predicted  accurately. 
Therefore,  a  detailed  discussion  of  the  shape  of  the  monaural 
temporal  weighting  window  and  the  limited  explanatory  val¬ 
ue  of  such  temporal  windows  (Piischel,  1988)  is  beyond  the 
scope  of  this  paper. 

IV.  DISCUSSION 

From  the  data  presented  here,  it  is  evident  that  the  bin¬ 
aural  system  reacts  more  sluggishly  to  a  change  in  the  "effec¬ 
tive  level"  of  the  masker  than  does  the  monaural  system. 
Similar  findings  of  binaural  sluggishness  have  been  reported 
by  several  authors  (Blauert,  1968.  1972;  Grantham  and 
Wightman,  1978.  1979;  Zurek  and  Durlach,  1987; 
Grantham,  1982,  1984)  using  different  psychoacouslic  ex¬ 
periments  and  different  binaural  cues.  There  are  at  least  two 
problems  with  comparing  their  time  constants  to  those  re¬ 
ported  here. 

First,  the  definition  of  a  "lime  constant"  varies  from 
experiment  to  experiment.  As  shown,  when  appropriate  cor¬ 
rections  are  made,  our  time  constants  agree  to  a  first  approx¬ 
imation  with  those  of  Grantham  and  Wightman  ( 1979). 
Other  authors  have  derived  time  constants  between  200  and 
500  ms  from  the  dependence  of  the  MLD  on  masker  dura¬ 
tion  in  simul.aneous  masking  (Yost,  1985  (see  footnote  6); 
Kohlrausch.  1986 1  and  nonsimultaneous  masking  (Small  ei 
a!.,  1972;  Lakey,  1976).  Unfortunately,  all  underlying  mod¬ 
el  assumptions  were  not  specified  exactly  in  these  studies,  so 
that  a  quantitative  analysis  that  relates  our  comparatively 
low  time  constants  to  their  findings  is  impossible. 

Second,  since  different  experiments  test  different  binau¬ 
ral  abilities,  i.e..  localization  (Blauert.  1968),  lateralization 
(Blauert,  1972;  Grantham  and  Wightman,  1978;  Pollack. 
1978;  Grantham.  1982),  and  detection  (Grantham  and 
Wightman,  1979;  Kollmeier,  1986;  Kohlrausch,  1986),  it  is 
possible  that  many  of  these  time  constants  estimate  different 
properties  of  the  binaural  system,  even  though  most  binaural 
time  constants  are  of  the  same  order  of  magnitude. 

Although  the  variability  in  the  data  within  studies  and 
between  studies  that  use  similar  techniques  might  be  used  to 
argue  against  this  second  point,  the  possibility  of  multiple 
different  sources  of  binaural  sluggishness  has  some  appeal. 
In  our  experiment,  the  slow  decay  of  binaural  masking  might 
simply  be  attributed  to  forward  masking  in  a  comparatively 
sluggish  binaural  transmission  channel  (e.g.,  at  the  output  of 
the  "C”  mechanism  of  Durlach,  1972),  while  the  masking 
level  decays  more  slowly  than  in  a  monaural  channel  ( e.g.,  at 
either  of  the  monaural  inputs  to  the  "decision  unit"  of  Dur¬ 
lach,  1972).  Since  detection  would  occur  whenever  the  level 
of  the  probe  tone  exceeds  the  temporary  masking  level  in  any 
of  these  channels,  the  performance  in  our  "monaural"  con¬ 
ditions  would  be  determined  by  the  fast  decay  in  the  monau¬ 
ral  channels,  whereas  the  slow  decay  of  the  binaural  channel 
would  only  show  up  in  our  “binaural"  conditions.  For  the 
present  discussion,  we  denote  the  slow  decay  in  the  binaural 
channel  as  “binaural  channel  sluggishness."  When  predict¬ 


ing  dynamic  localization  and  interaural  correlation  dis¬ 
crimination  tasks  with  the  same  kind  of  binaural  model,  a 
more  sophisticated  detection  strategy  has  to  be  assumed. 
With  Durlach’s  EC  model,  for  example,  the  outputs  of  sever¬ 
al  hypothetical  “EC"  mechanisms  tuned  to  different  inter¬ 
aural  delays  and  intensity  differences  have  to  be  monitored 
over  time,  and  discrimination  would  occur  if  any  change  in 
the  distribution  of  output  levels  is  observed.  The  delay  asso¬ 
ciated  with  establishing  and  comparing  these  distributions 
from  the  multiple  sluggish  “binaural  channels"  over  time 
might  result  in  an  additional  binaural  sluggishness  that  we 
denote  as  "binaural  analyzer  sluggishness."  For  example, 
binaural  analyzer  sluggishness  might  contribute  to  the  time 
constants  of  about  200  ms  reported  by  Blauert  ( 1968, 1972), 
who  studied  the  maximum  delectable  frequency  of  switch¬ 
ing  between  different  sound  source  locations.' 

In  most  experiments  on  dynamic  properties  of  the  bin¬ 
aural  system,  both  hypothetical  sources  of  binaural  sluggish¬ 
ness  might  influence  the  obtained  time  constants.  In  our  ex¬ 
periment.  however,  the  comparatively  small  time  constants 
could  suggest  a  negligible  role  of  binaural  analyzer  sluggish¬ 
ness:  The  binaural  system  might  attempt  to  detect  the  S . 
probe  tone  by  adopting  a  fixed  binaural  processing  strategy 
(such  as  subtracting  both  monaural  stimuli  without  inter¬ 
aural  delay  in  the  C  mechanism  of  Durlach.  1972.  or  inspect¬ 
ing  only  one  optimum  interaural  lime  delay  "place"  of  Jef- 
fress.  1948 )  and  therefore  monitoring  the  output  of  only  one 
"sluggish"  binaural  channel.  In  this  case,  the  detection  pro¬ 
cess  could  be  similar  to  the  monaural  case  in  that  the  binau¬ 
ral  analyzer  would  not  base  to  compare  activity  across  chan¬ 
nels  of  the  binaural  display  or  within  channels  as  a  function 
of  time.  The  threshold  is  determined  by  the  effective  level  of 
the  masker  at  the  output  of  the  binaural  channel  and  thus  is 
influenced  by  binaural  channel  sluggishness,  but  not  by  btn- 
aural  analyzer  sluggishness.  Within  this  view,  the  measure- 
menls  reported  here  would  provide  a  lower  limit  on  esti¬ 
mates  of  binaural  sluggishness. 
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APPENDIX 

The  stimulus  used  by  Grantham  and  Wightman  ( 1979 ) 
had  a  time-dependent  interaural  cross-correlation  coeffi¬ 
cient  that  varied  sinusoidally  at  the  modulation  frequency 
u)  =  lnf„. 

p(f)  =  sin  lof.  (Al) 

The  average  interaural  cross-correlation  coefficient  r(r)  is 


L 


obtained  by  substituting  Eq.  (At )  and  the  appropriate  win¬ 
dow  function  (Eqs.  (3)-(7))  into  Eq.  (2),  such  that 

u)(/ —  / ')sin (Ut '<//'  =  Im[e  tVUa)]. 

(A2) 

In  this  equation,  Im  denotes  the  imaginary  part  of  a 
complex  number, 7  is  v  —  1 ,  and  U'ioi)  is  the  Fourier  trans¬ 
form  of  the  window  w(t)  The  right-hand  side  of  Eq.  (A2) 
follows  from  the  definition  of  the  Fourier  transform  of  a 
continuous  function  u'(  /  ’ )  and  its  transformation  properties 
if  the  function  is  time  reversed  and  shifted  by  an  amount  t. 
For  a  fixed  modulation  frequency /„, .  the  largest  MLD  with 
a  short  S_  probe  tone  is  obtained  for  the  largest  possible 
pessitive  value  of  r{i),  which  is  simply  the  modulus  of  IFt*)): 

ttt**  [r,,  (t)  ]  =  I  .  (A.1) 

For  a  rectangular  temporal  window  |  Eq.  ( 3 )  |  and  compara- 
llvely  low  modulation  frequencies  \  f,„  <  l/2(r|  +  r.  )|. 
the  largest  possible  value  of  r(/)  is 

r,„„  (Mirectangular)  =sinw(r,  +  r.l/wtr,  -i-  -,). 

(A4) 

Note  that  decreases  with  l/w. 

The  largest  possible  value  of  ril)  for  a  double-sided  ex¬ 
ponential  window  (Eq.  (6)  |  is 

r,„,„  (wjexponential)  =  ( ( I  +  oi'r  \  ) 

<(1  -i-w'',  ;:)]  '  (A5) 

The  respective  value  for  the  single-sided  exponential 
window  is  obtained  by  setting  r,  to  zero  in  Eq.  (AS).  Thus 
r,,,.,,  for  the  single-sided  exponential,  like  the  rectangular 
window,  decrea,ses  with  \/<o. 

By  inserting  Eq,  ( A4)  or  (AS)  into  Eq.  ( 1 ).  the  mini¬ 
mum  binaural  masked  threshold,  which  corresponds  to  the 
maximum  MLD,  can  be  calculated  for  Grantham  and 
Wightman’s  maskers.  These  predictions  are  plotted  in  Fig.  4 
versus  the  modulation  frequency /,„  <o/2it. 

'We  use  the  term  "monaural”  to  refer  to  conditions  where  performance 
ux)uld  not  he  expected  to  be  appreciably  altered  if  the  stimuli  had  been 
presented  to  only  one  ear.  Note  that  many  such  conditions  would  not  be 
truly  monaural-  Similarly,  we  refer  to  "binaural”  conditions  as  those  where 
performance  uxtutd  be  expected  to  be  appreciably  altered  if  the  signal  and 
ma.sking  stimuli  had  been  presented  to  only  one  ear-  Note  that  not  all  con¬ 
ditions  where  .stimuli  are  presented  to  both  ear^  meet  this  critenon- 
There  are  two  possible  ways  to  normalize  temporal  windows  First,  their 
integral  can  be  normalized  to  unity,  and.  second,  their  maximum  value  can 
be  normalized  to  unity  Our  approach  uses  the  hrst  method,  consistent 
with  Eq.  (2)  and  with  the  linear  amplitude  scale  on  which  the  window 
functions  were  defined.  The  second  method  would  he  appropriate  if  the 
window  functions  had  been  defined  on  a  logarithmic  amplitude  scale  with 
their  maximum  value  equal  to  0  dB  To  compare  different  window  func¬ 
tions  normalized  with  the  second  methixl.  the  "equivalent  rectangular  du¬ 
ration"  is  defined  as  specified  by  Moore  et  al.  ( That  is.  the  integral 
of  the  window  function  is  equated  to  the  integral  of  a  rectangular  window 
with  amplitude  1.0. 

For  our  window  functions  normalized  with  the  first  method,  the  equiva¬ 
lent  rectangular  duration  is  computed  by  first  scaling  the  maximum  value 
to  unity  and  then  calculating  the  integral  of  the  functions  Therefore,  the 
equivalent  rectangular  duration  is  given  by  the  inverse  of  the  temporal  win¬ 
dows  maximum  value.  With  a  double-sided  exponential  window,  for  exam¬ 
ple.  the  equivalent  rectangular  duration  is  simply  the  sum  of  r,  and  . 
'One  possible  explanation  for  the  2-dB  discrepancy  between  the  thresholds 
for  very  laige  positive  and  negative  values  of  r  and  the  respective  nontran¬ 


sient  thresholds  is  that  switching  the  interaural  phase  produces  a  transient 
auditory  sensation  that  is  reported  by  the  subjects  to  sometimes  sound  like 
(he  probe  tone.  The  perception  of  this  transient,  as  well  as  the  transient 
produced  by  a  sudden  increase  or  decrease  in  masker  level,  might  confuse 
the  subject  even  if  the  probe  tone  is  presented  considerably  before  or  after 
switching  the  masker. 

^Thc  normalized,  nonlinear  deviation  measure  B„,  was  computed  as 

B„,  =  I  -  2  ^  (t  -  Z)-'. 

where/.,  (i  —  I.-  .J)  are  the  y  threshold  values  obtained  at  the  time  delays 
/,  (  /  =  l.—V).  Lit, )  is  the  mixlel  functional  lime  /  .and  Lis  the  average  of 
all  J  ihres}u>]d  values.  Since  always  less  than  or  equal  to  one,  the 
model  function  that  his  the  data  best  yields  the  highest  value  of  B,., . 

'In  (he  binaural  conditions.  r(  /}  is  interpreted  as  the  average  interaural  cor¬ 
relation  coefficient  that  is  given  in  units  i>f  signal  power  and  ranges  between 
-  I  and  ).  For  an  analogous  description  of  the  monaural  condition,  the 
function  ft/)  replaces  r</)  such  that  {I  -  q{t)\  is  interpreted  a.s  average 
masker  power  and  ranges  between  0  and  2.  By  normalizing  1 1  --  |.  the 

resulting  expression  h(/)  =  [  I  -  9(1)  )/(Ar  -  I )  can  be  interpreted  a.s  the 
weighted  average  of  that  pan  of  the  masker  power  that  exceeds  the  -  1 5- 
dB  level  of  the  masker  Hence.  Eqs  ( I )  and  ( <) )  are  equivalent,  although 
their  parameters  would  have  different  mterpretaiions. 

''It  should  alv>  he  noted  that  the  data  of  Yost  ( ldH5)  appear  to  conflict 
with  (he  data  presented  here.  That  is.  ^'mt's  results  imply  (hat 
the  N  .  segment  of  the  noise  should  not  influence  the  detectability  of  a 
probe  lone  that  is  presented  m  the  N,  segment  of  the  noise  These  contra- 
dK'tions  are  considered  in  greater  detail  by  Gilkey  et  at.  ( I'W)).  wh^  :ried 
and  failed  to  replicate  the  results  of  Ymt  One  possibility  is  that  the  dilTer- 
cni  findings  are  due  to  short  gaps  before  switching  the  interaural  phase  in 
the  binaural  noise  stimuli  employed  by  Yost,  whereas  no  artifact  was  pres- 
em  m  the  stimuli  used  here  and  by  Ciikey  ft  a! 

'Alternatively,  btnh  hypothetical  types  of  binaural  sluggishness  can  he  de¬ 
fined  in  terms  of  an  interaural  delay  line  mcxlel  of  1  he  i>  pc  first  pniposed  by 
Jeffress  1 1*^48).  Binaural  channel  sluggishness  wimld  correspond  to  the 
attack  and  decay  rate  of  a  sound  image  located  at  a  certain  interaural  delay 
time;  whereas  binaural  analyzer  sluggishness  would  correspond  to  the  tem¬ 
poral  properties  of  the  binaural  analyzer  that  evaluates  the  patterns  along 
the  interaural  delay  line 
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Masker  fringe  and  binaural  detection 
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Yost  1 J.  Acoust.  Soc.  Am.  7*.  901-907  ( 1985)  1  found  that  the  detectability  of  a  30-ms 
dichotic  signal  (Sir)  in  a  30-ms  diotic  noise  (No)  was  not  affected  by  the  presence  of  a  500-ms 
dichotic  forward  fringe  (Nir).  Kollmeier  and  Gilkey  (J.  Acoust.  Soc.  Am.  87,  1709-1719, 

( 1990)  1  performed  a  somewhat  different  experiment  and  varied  the  onset  lime  of  a  25-ms  Sir 
signal  in  a  750-ms  noise  that  switched,  after  375-ms,  from  Ntr  to  No,  In  contrast  to  Yost,  they 
found  that  the  Nrr  segment  of  the  noise  reduced  the  detectability  of  the  signal  even  when  the 
signal  was  temporally  delayed  well  into  the  No  segment  of  the  noise  and  suggested  that  the  N  it 
segment  of  noise  acted  as  a  forward  masker.  To  resolve  this  apparent  conflict,  the  present  study 
investigated  the  detectability  of  a  brief  Sir  signal  in  the  presence  of  an  No  masker  of  the  same 
duration  as  the  signal.  The  masker  was  preceded  by  quiet  or  an  Nir  forward  fringe  and 
followed  by  quiet,  an  No,  or  Nn-  backward  fringe.  The  present  study  differs  from  most  previous 
studies  of  the  effects  of  the  masker  fringe  in  that  the  onset  time  of  the  signal  was  systematically 
varied  to  examine  how  masking  changes  during  the  time  course  of  the  complex  fringe-masker- 
fringe  stimulus.  The  results  failed  to  replicate  those  of  Yost  in  that  an  Ntr  forward  fringe 
reduced  the  detectability  of  the  signal,  and  agreed  with  those  of  Kollmeier  and  Gilkey  in  that 
thresholds  were  elevated  well  after  the  offset  of  the  Nir  segment  of  the  noise.  The  addition  of 
an  Nr  backward  fringe  was  also  shown  to  reduce  the  detectability  of  the  signal.  Possible 
reasons  for  differences  between  the  results  of  the  present  study  and  those  of  Yost  are  evaluated. 
Results  are  discussed  in  the  context  of  models  of  binaural  detection. 

PACS  numbers:  43.66.Dc,  43.66.Pn  [  WAY  J 
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IKTRODUCTION 

McFadden  (1966)  investigated  the  detectability  of  a 
low-frequency  (400-Hz),  125-ms,  sinusoidal  signal  in  the 
presence  of  wideband  masking  noise  that  was  either  contin¬ 
uous  or  that  was  pulsed  on  and  off  with  the  signal.  Under 
diotic  conditions,  thresholds  in  continuous  noise  were  slight¬ 
ly  lower  than  thresholds  in  pulsed  noise  (about  0.4  dB).' 
However,  under  dichotic  conditions,  the  difference  in 
thresholds  obtained  with  continuous  and  pulsed  maskers 
was  much  larger,  on  the  order  of  4-6  dB.  McFadden  also 
examined  conditions  in  which  the  masker  onset  occurred 
before  the  signal  onset,  but  their  offsets  were  simultaneous. 
He  referred  to  the  portion  of  the  masker  that  occurred  before 
the  signal  onset  as  a  forward  masker  "fringe."  Thresholds 
improved  as  the  duration  of  this  fringe  was  increased.  Per¬ 
formance  was  comparable  to  the  continuous  masker  condi¬ 
tion  when  the  duration  of  the  fringe  exceeded  600  ms. 

Robinson  and  Trahiotis  (1972)  considered  the  influ¬ 
ence  of  signal  duration  on  this  eflect.  Their  results  with  a 
261-ms  signal  replicated  those  of  McFadden.  With  a  37-ms 
signal  they  found  even  larger  diffeiences  between  pulsed  and 
continuous  maskers  (about  9  dB).^ 

Trahiotis  el  ai  ( 1972)  showed  that  the  addition  of  a 
backward  fringe  after  a  pulsed  masker  has  effects  similar  to, 
although  not  as  strong  as,  those  of  a  forward  fringe  (i.e.,  as 
the  duration  of  a  backward  fringe  is  increased,  performance 
approaches,  but  does  not  equal,  that  for  a  continuous  mask¬ 
er). 

The  differences  between  pulsed  and  continuous  maskers 
observed  under  dichotic  conditions  have  been  interpreted  in 


the  context  of  models  that  state  that  interaural  differences 
between  the  parameters  of  the  waveforms  presented  to  the 
two  ears  provide  the  basis  for  binaural  detection  (e  g.,  the 
vector  model  of  Jeffress,  1 972 ) .  For  example,  if  we  consider 
the  detection  of  an  interaurally  oul-of-phase  signal  (Sr)  in 
the  presence  of  an  interaurally  in-phase  noise  (No),  on 
noise-alone  trials  the  interaural  differences  aie  constant  and 
equal  to  zero,  while  on  signal-plus-noise  trials  there  are  on¬ 
going  fluctuations  in  the  interaural  parameters  (e.g.,  inter¬ 
aural  lime  differences)  during  the  lime  the  sigiutl  is  on  (the 
“ongoing”  cue).  If  the  noise  is  on  continuously,  there  is  also 
a  shift  in  interaural  parameters,  from  diotic  to  dichotic.  that 
occurs  at  the  onset  of  the  signal  (the  "onset”  cue).  The  ab¬ 
sence  of  the  onset  cue  with  pulsed  noise  maskers  is  used  to 
account  for  the  decrease  in  detectability  relative  to  contin¬ 
uous  noise  maskers.  Similarly,  because  the  addition  of  a  for¬ 
ward  fringe  reintroduces  the  onset  cue,  performance  com¬ 
parable  to  that  with  the  continuous  masker  is  predicted. 

Yost  (1985),  following  earlier  authors  (e.g.,  McFad¬ 
den,  1966;  Robinson  and  Trahiotis,  1972),  suggested  that 
the  presence  of  a  forward  and/or  backward  fringe  allows  the 
binaural  system  to  "establish  a  baseline”  against  which 
changes  in  interaural  parameters  resulting  from  the  signal 
onset  or  olftet  can  be  detected.  Because  the  binaural  system 
responds  slowly  to  changes  in  interaural  parameters,  detec¬ 
tion  is  facilitated  when  the  duration  of  the  masker  fringe 
increases,  providing  more  time  for  estimation  of  these  base¬ 
line  parameters.  Based  on  previous  Sndings  and  a  number  of 
additional  manipulations  performed  in  his  study,  Yost  con¬ 
cluded  that  the  presence  of  fringe  activity  anywhere  during 
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an  interval  approximately  300  to  SOO  ms  in  duration  immedi¬ 
ately  before  signal  onset  can  have  an  effect  on  the  measured 
threshold.  He  further  suggested  that  the  length  of  this  inter¬ 
val  corresponds  to  the  time  required  for  accurate  estimation 
of  interaural  parameters. 

Yost  also  found  that,  when  the  interaural  parameters  of 
the  fringe  were  different  from  those  of  the  masker  (e.g.,  an 
Ntt  fringe  preceding  an  No  masker),  thresholds  were  the 
same  as  with  a  pulsed  noise  masker  (i.e.,  the  fringe  had  no 
effect  on  detectability).  Yost  invoked  the  equalization-can¬ 
cellation  (E-C)  model  of  Durlach  ( 1972)  to  explain  why  a 
forward  fringe  with  the  same  interaural  parameters  as  the 
masker  yielded  lower  thresholds  than  a  forward  fringe  with 
interaural  parameters  different  from  those  of  the  masker.  He 
argued  that,  when  an  No  fringe  was  presented  before  an  No 
masker,  the  equalization  stage  of  the  model  would  adopt  a 
transformation  to  cancel  the  masker  fringe.  Such  a  strategy 
would  also  be  optimal  for  detecting  the  signal,  canceling  the 
masker  and  doubling  the  signal.  Because  it  takes  some  time 
for  the  equalization  stage  to  adopt  the  correct  transforma¬ 
tion,  the  longer  the  fringe  (up  to  SOO  ms),  the  lower  the 
measured  threshold.  When  an  Nir  fringe  preceded  an  No 
masker,  the  equalization  transformation  would  again  be 
chosen  to  cancel  the  fringe.  Because  it  takes  time  for  the 
equalization  stage  to  adopt  a  new  transformation,  the  No 
masker  would  not  be  canceled,  and  the  Sir  signal,  if  present, 
would  be  reduced.  Thus,  within  this  interpretation,  we 
would  expect  performance  to  be  much  worse  under  the  Nir 
forward  fringe  condition  than  for  a  continuous  No  masker 
condition  or  for  an  No  forward  fringe  condition. 

Yost's  interpretation  does  not  specifically  address  the 
pulsed  masker  condition.  However,  it  seems  unlikely  that  a 
quiet  period  preceding  an  No  masker  would  induce  the 
equalization  stage  to  adopt  such  a  nonoptimal  equalization 
transformation.  Thus  we  would  expect  the  threshold  for  the 
pulsed  noise  condition  to  be  lower  than  the  threshold  for  the 
Nir  forward  fringe  condition.  This  prediction  conflicts  with 
Yost's  findings. 

Kollmeier  and  Gilkey  ( 1990)  were  also  interested  in  the 
temporal  properties  of  the  binaural  system.  However,  their 
approach,  at  least  on  the  surface,  was  somewhat  different. 
They  wished  to  compare  the  time  course  of  temporal  mask¬ 
ing  under  monaural  and  binaural  conditions.  Kollmeier  and 
Gilkey  noted  that,  in  previous  studies  where  the  masker  was 
simply  switched  on  and  off,  the  results  were  difficult  to  inter¬ 
pret,  because  the  resultant  change  in  the  level  of  the  effective 
masker  influenced  not  only  the  binaural  channel,  but  also 
the  monaural  channel.  They  therefore  measured  the  detect¬ 
ability  of  a  brief  2S-ms  Sir  signal^  in  the  presence  of  a  7S0-ms 
noise  masker  whose  interaural  phase  was  switched  from  Nir 
to  No  after  37S  ms,  as  a  function  of  the  delay  between  the 
transition  in  the  noise  and  the  onset  of  the  signal  (At).  This 
masker  configuration  produces  a  change  in  the  effective  level 
of  the  masker  at  the  phase  transition  within  the  binaural 
channel,  but  not  within  the  monaural  channels.  The  curve  in 
Fig.  1  shows  the  results  for  this  Nir-No  masker  configura¬ 
tion.  There  is  a  gradual  transition  in  the  amount  of  masking, 
reminiscent  of  a  temporal  masking  function.  Note,  however, 
that,  when  &/  =  0  ( the  point  marked  by  the  F)  this  temporal 
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FIG.  1 .  Threshold  level  of  a  20'ms  Sv  signal  as  estimated  by  a  3'down  I  -up 
(79.4 percent  correct)  adaptive  staircase  procedure  plotted  as  a  function 
dr.  the  delay  between  the  phase  transition  in  the  noise  and  the  onset  of  the 
signal,  for  an  Nir-No  masker  configuration.  The  C  shows  the  threshold  in 
tlac  center  of  a  750-ms  Mo  masker.  The  vertical  line  corresponds  to  a  signal 
onset  temporally  aligned  with  the  transition  between  the  first  and  second 
segment  of  the  noise.  The  data  shown  are  the  average  thresholds  of  four 
suli^ts,  based  on  the  rcsultsof  Kollmeier  and  Gilkey  (1990). 


masking  configuration  is  quite  similar  to  some  of  the  masker 
fringe  conditions  investigated  by  previous  researchers.  That 
is,  there  is  both  an  Nir  forward  fringe  and  an  No  backward 
fringe.  The  C  at  A/  —  0  represents  threshold  with  a  “pseudo- 
continuous”  No  noise  (i.e.,  a  750-ms  No  masker  with  no 
phase  transition).  Thus  the  two  pointe  indicated  by  the  F 
and  the  C  are  very  similar  to  the  forward  fringe  condition 
and  the  continuous  masker  condition  investigated  by  Yost. 
The  10-dB  difference  between  the  Nir-No  masker  configu¬ 
ration  (point  F)and  the  continuous  masker  configuration 
(point  C)  at  Ar  =  0  is  comparable  to  the  difference  obtained 
between  pulsed  and  continuous  maskers  under  dichotic  con¬ 
ditions  in  other  studies  (e.g.,  Yost,  1985;  Robinson  and  Tra- 
hiotis,  1972).  Thus  it  might  be  suggested  that  the  threshold 
would  not  change  if  the  Nir  segment  of  the  Nir-No  masker 
were  deleted,  a  conclusion  in  agreement  with  Yost’s  finding 
that  the  Nir  forward  fringe  has  no  effect  on  the  measured 
threshold. 

A  more  careful  consideration  of  the  Kollmeier  and  Gil¬ 
key  data  indicates  that  the  Nir  segment  of  the  noise  is  having 
a  substantial  effect.  First,  it  should  be  recalled  that,  in  pre¬ 
vious  studies,  the  presence  of  an  No  backward  fringe  had 
been  found  to  improve  detectability  over  the  pulsed  masker 
condition  (e.g.,  T rahiotis  el  al.,  1 972 ) .  When  the  duration  of 
the  backward  fringe  is  similar  to  that  present  at  point  F,  the 
detectability  of  the  signal  should  only  be  about  3  dB  worse 
than  in  the  continuous  masker  condition.  Thus,  if  the  Nir 
segment  of  the  noise  really  had  no  effect  in  the  study  of  Koll¬ 
meier  and  Gilkey,  we  would  expect  the  difference  between 
point  F  and  point  C  to  have  been  around  3  dB,  rather  than 
the  approximately  10  dB  observed  in  Fig.  I.  Second,  the 
overall  shape  of  the  threshold  function  in  Fig.  1,  including 
the  continued  improvement  in  detectability  as  the  signal  is 
moved  farther  away  from  the  Nir  segment  of  the  noise  and 
into  the  No  segment  of  the  noise,  was  interpreted  by  Koll¬ 
meier  and  Gilkey  ( 1990)  to  indicate  that  the  Nir  segment 
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acts  like  a  forward  masker.  ( That  is,  its  effects  last  long  after 
its  offset. ) 

Kollmeier  and  Gilkey  also  interpreted  their  results 
within  the  context  of  the  E-C  model,  but  assumed  that  the 
equalization  stage  adopts  a  strategy  that  is  optimal  to  cancel 
the  No  segment  of  the  masker.  Therefore,  the  Ntr  segment  is 
effectively  doubled  by  the  cancellation  stage,  such  that  the 
level  of  the  masker  at  the  output  of  the  cancellation  stage  is 
quite  large  during  the  Na-  segment,  compared  to  the  level 
during  the  No  segment.  If  activity  in  the  binaural  channel 
does  not  decay  instantaneously  (e  g.,  see  Grantham  and 
Wightman,  1979),  activity  associated  with  the  Nir  segment 
of  the  noise  will  continue  even  after  the  phase  transition  in 
the  noise.  Kollmeier  and  Gilkey  argued  that  the  function 
shown  in  Fig.  I  describes  the  time  course  of  this  decay  of 
activity.  Although  Kollmeier  and  Gilkey  did  not  make  spe¬ 
cific  predictions  for  the  pulsed  masker  condition,  it  can  be 
seen  that,  while  the  masking  functions  for  an  No  masker 
with  and  without  an  N-  forward  fringe  might  intersect  at 
A/  =  0,  presumably  they  are  quite  different  functions  else¬ 
where. 

Kollmeier  (1986)  directly  replicated  Yost,  measuring 
the  detectability  of  a  brief  Str  signal  in  a  brief  No  masker.  He 
reported  the  difference  between  the  pulsed  masker  and  con¬ 
tinuous  masker  conditions  to  be  about  3.3  dB  and  the  differ¬ 
ence  between  the  Njt  forward  fringe  and  pulsed  masker  con¬ 
ditions  to  be  aim  os'  lOdB.  These  results  also  suggest  that  the 
Ntr  forward  fringe  is  having  a  substantial  effect.  ‘ 

To  resolve  these  apparent  differences  between  the  stud¬ 
ies  of  Yost  and  of  Kollmeier  and  Gilkey  ( 1990)  and  Koll¬ 
meier  ( 1986),  a  series  of  experiments  was  conducted  to  reex¬ 
amine  the  effects  of  forward  and  backward  masker  fringe  on 
dichotic  tone-in-noise  masking. 

I.  GENERAL  METHODS 

Both  signal  and  noise  stimuli  were  output  at  a  20-kHz 
sampling  rale  through  separate  1 6-bit  digital-to-analog  con¬ 
verters  and  passed  through  7.8-kHz  low-pass  antialiasing 
filters.  The  signal  and  noise  stimuli  were  added  with  an  ana¬ 
log  mixer  and  presented  via  TDH-49  headphones  mounted 
in  circumaural  cushions  (Grason-Stadler  model  OOIA)  to 
subjects  in  individual  sound-attenuating  booths.  Stimulus 
generation  and  presentation  and  response  collection  were 
controlled  by  an  SMS-KXX)  minicomputer  with  a  PDF 
1 1/73  processor. 

The  masking  stimulus  was  pseudorandom  noise  genera¬ 
ted  with  a  33-bit  software  shift  register  (Gilkey  el  al.,  1988) 
and  bandpass  filtered  between  1(X)  and  30(X)  Hz  by  a  Krohn- 
Hite  filter  (model  3730)  with  slopes  set  to  24  dB/oct.  The 
noise  was  turned  on  and  off  essentially  instantaneously. 
Thus  the  rise/fall  time  was  determined  by  the  filters  and  the 
headphones.  The  spectrum  level  of  the  noise  was  40  dB 
SPL/Hz.  The  phase  of  the  noise  in  the  left  ear  was  held 
constant,  while  the  phase  of  the  noise  in  the  right  ear  was 
changed  by  180*  for  Ntr  presentations.  It  should  be  noted 
that  the  generated  noise  can  be  at  either  of  only  two  instanta¬ 
neous  amplitudes.  Therefore,  instantaneous  changes  in  the 
interaural  phase  of  the  noise  do  not  introduce  transients  into 
the  monaural  waveform. 
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The  signal  was  a  computer-generated  SOO-Hz  sinusoid 
with  a  total  duration  of  20  ms  ( measured  from  the  beginning 
of  the  rise  to  the  end  of  the  fall)  and  was  shaped  with  5-ms 
linear  rise/fall  ramps.  It  was  always  presented  1 80°  out  of 
phase  interaurally  (Sir).  The  level  of  the  signal  was  con¬ 
trolled  by  separate  programmable  attenuators  for  each  sub¬ 
ject. 

Figure  2  shows  a  schematic  diagram  of  the  stimulus  con¬ 
ditions  examined  in  these  experiments.  The  noise  masker 
can  be  thought  of  as  being  divided  into  three  segments.  The 
first  and  last  segments  had  a  duration  of  330  ms,  while  the 
middle  segment  had  a  duration  of  20  ms.  The  middle  seg¬ 
ment  always  contained  No  noise.  The  first  and  third  seg¬ 
ments  could  contain  No  noise.  Ntt  noise,  or  quiet.  Threshold 
for  detection  of  the  signal  was  measured  as  a  function  of  the 
delay!  Ar)  from  the  beginning  of  the  middle  segment  of  the 
noise  to  the  onset  of  the  signal.  For  the  particular  case  when 
A;  =  0,  the  successive  panels  of  this  figure  show  stimulus 
configurations  analogous  to:  (a)  “pseudo-continuous" 
masker;  (b)  pulsed  masker;  (c)  N n- forward  fringe;  (d)  No 
backward  fringe;  (e)  Nir  forward  fringe  and  No  backward 
fringe;  (f)  Nir  backward  fringe;  and  (g)  Nn- forward  fringe 
and  Nir  backward  fringe  conditions.  For  convenience,  we 
often  refer  to  the  first  segment  of  the  noise  masker  as  the 
“forward  fringe”  and  the  third  segment  of  the  noise  masker 
as  the  “backward  fringe."  These  terms  are  correct  when 
At  =  0,  but  something  of  a  misnomer  when  A/  ^  0.  Note  that 
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FIG.  2.  Schematic  diagram  showing  the  seven  masker  configuratkms  mvca- 
tigated  in  this  paper:  (a)  pseudo-continuous  madter.  (b)  poised  masker; 
(c)  Nvforward  fringe;  (d)  No  backward  frmge;  (e)  Nrforward  fringe  and 
No  backward  fringe;  (f)  Nr  backward  fringe;  and  (g)  Nr  forward  fringe 
and  Nr  backward  fringe  conditions.  See  text  for  details. 
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the  stimulus  configurations  shown  in  panels  (a)-(c)  are 
analogous  to  those  studied  by  Yost,  and  the  stimulus  config¬ 
urations  shown  in  panels  (a)  and  (e)  are  analogous  to  those 
investigated  by  Kollmeier  and  Gilkey.  The  specific  condi¬ 
tions  tested  in  each  experiment  are  described  with  those  ex¬ 
periments. 

II.  EXPERIMENT  1— Ntr  FORWARD  FRINGE 

The  first  experiment  was  performed  to  determine  if  the 
ma.sking  functions  for  the  pulsed  masker  and  Nn-  forward 
fringe  conditions  did  indeed  intersect  when  A/  =  0,  or  if  the 
presence  of  the  Nf  forward  fringe  caused  a  further  elevation 
of  thresholds  above  those  observed  for  pulsed  maskers. 

A.  Methods 

1.  Subjects 

Four  subjects,  two  males  and  two  females  between  the 
ages  of  19  and  21,  were  paid  for  their  participation.  All  had 
clinically  normal  hearing  as  measured  by  audiometric  test¬ 
ing.  Subjeets  received  extensive  training  before  data  collec¬ 
tion  began. 

2.  Procedure 

A  two-alternative,  forced-choice  (2AFC)  procedure 
was  used  to  obtain  thresholds.  The  beginning  of  each  trial 
was  marked  by  a  60-ms  warning  light.  A  second  light  with  a 
duration  of  60  ms  was  turned  on  760  ms  after  the  offset  of  the 
warning  light  to  mark  the  first  observation  interval,  and  760 
ms  after  the  offset  of  this  light  a  third  light  was  turned  on  for 
60  ms,  marking  the  second  observation  interval.  The  signal, 
if  present,  was  turned  on  20  ms  after  the  light  marking  the 
appropriate  observation  interval.  The  760-ms  interval 
between  the  observation  intervals  was  set  such  that  the  offset 
of  the  masker  in  the  first  interval  and  the  onset  of  the  masker 
in  the  second  interval  would  be  separated  by  at  least  100  ms. 
Subjects  pressed  a  button  to  indicate  the  interval  containing 
the  signal.  Trials  were  self-paced,  and  subjects  received  visu¬ 
al  feedback  on  a  trial-by-trial  basis. 

The  typical  experimental  session  consisted  of  four  sets  of 
four  blocks.  Masker  configuration  and  At,  the  interval  from 
the  beginning  of  the  second  segment  of  the  noise  to  the  onset 
of  the  signal,  were  randomly  selected  across  blocks,  but  were 
not  varied  within  blocks.  Thresholds  for  each  block  were 
estimated  by  a  3-down,  1-up  adaptive  staircase  procedure 
(Levitt,  1971 ),  corresponding  to  a  target  percent  correct  of 
79.4.  Initial  signal  levels  were  set  at  approximately  IS  dB 
above  the  anticipated  threshold  and  varied  in  4-dB  steps  for 
the  first  two  reversals,  2-dB  steps  for  the  next  two  reversals, 
and  1  -dB  steps  thereafter.  A  block  was  terminated  after  all 
subjects  had  completed  a  minimum  of  ten  reversals.  Thresh¬ 
olds  were  estimated  by  discarding  the  first  four  reversals  and 
averaging  the  signal  level  at  the  remaining  reversals,  up  to 
the  highest  even  number  of  reversals. 

Three  stimulus  configurations  were  tested,  correspond¬ 
ing  to  the  first  three  configurations  shown  in  Fig.  2,  panels 
(a)-(c):  No-No-No  masker  configuration,  QUIET-No- 
QUIET  masker  configuration,  and  Ntr-No-QUIET  masker 
configuration.  For  the  QUIET-No-QUIET  condition  and 


the  N)r-No-QUIET  masker  configuration.  At  was  varied 
from  —  185  to  185  ms.  For  the  No-No-No  masker  configu¬ 
ration,  Af  was  always  equal  to  0.  Between  8  and  1 1  blocks  of 
trials  were  conducted  for  each  value  of  Ar  with  each  masker 
configuration. 


B.  Results  and  discussion 

Figure  3  shows  the  threshold  signal  level  for  each  sub¬ 
ject  under  each  of  the  three  masker  configurations  for 
A/  =  0,  corresponding  to  "pseudo-continuous"  masker, 
pulsed  masker,  and  Nn-  forward  fringe  conditions.  Consis¬ 
tent  with  previous  findings,  thresholds  for  the  pulsed  masker 
condition  are  3.7-12.0  dB  higher  than  thresholds  in  the 
pseudo-continuous  masker  condition.  However,  in  contrast 
to  the  findings  of  Yost  ( 1985),  thresholds  for  the  Nn-  for¬ 
ward  fringe  condition  are  2.3-10.5  dB  higher  than  thresh¬ 
olds  for  the  pulsed  masker  condition,  indicating  that  the  N- 
forward  fringe  has  a  substantial  effect  on  detectability.  These 
results  are  similar  to  those  reported  by  Kollmeier  ( 1986). 

Thresholds  as  a  function  of  A/  for  the  Nn--No-QUlET 
and  for  QUIET-No-QUIET  masker  configurations  are 
shown  in  Fig.  4.  Each  data  point  represents  the  average  of 
four  subjects.  The  C  indicates  the  pseudo-continuous  mask¬ 
er  condition  at  Ar  =  0.  It  can  be  seen,  as  suggested  in  Fig.  3. 
that  the  functions  do  not  intersect  at  Ar  =  0.  Further,  these 
functions  do  not  overlap  at  any  point,  and  it  is  clear  that  the 
effects  of  the  Nir  forward  fringe  extend  well  beyond  its  offset. 

The  results  of  experiment  1  failed  to  replicate  those  of 
Yost  ( 1985 )  and  suggest  instead  that  the  presence  of  an  Ntr 
forward  fringe  does  reduce  detectability.  Further,  the  fact 
that  the  effects  of  the  Nrr  forward  fringe  continue  long  after 
its  offset  is  compatible  with  the  interpretation  of  Kollmeier 
and  Gilkey  { 1990)  that  the  Nsr  segment  of  the  noise  acts  as  a 
forward  masker.  Note  that  in  their  study  an  No  backward 
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FIG.  3.  Threshold  level  of  a  20-ins  hir  signal  whose  onset  is  simultaneous 
with  the  beginning  of  the  second  segment  of  the  noise  for  each  a(  four  sub¬ 
jects  in  three  different  masker  conditions,  as  estimated  by  a  3<down  I -up 
(79.4  percent  correct)  adaptive  staircase  procedure.  The  pseudo-contin¬ 
uous  masker  is  a  720-ms  No  noise,  the  pulsed  masker  is  a  20-ms  No  noise, 
and  the  Nir  forward  fringe  iruLsker  is  a  20-ms  No  noise  preceded  by  a  3SO-ms 
Njt  forward  fringe. 
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FIG.  4.  Threshold  le\el  of  a  20-ms  Sr  signal,  as  eslimaiedhy  a  .Td<*»n  i*up 
( 79.4  perceni  eorreet )  adapt  i'c  staircase  prwedure.  ispU)iied  as  a  function 
of  A/,  the  lime  from  the  beginning  of  second  segment  of  the  imise  to  the 
onset  of  the  signal  for  the  N  r-No-QLt  1 BT  masker  configuration  and  the 
GL'IET-Nt>-QUIET  nvasker  configuration.  The  C  shows  the  threshold  in 
the  pseudo-continuous  masker  condition  at  Ar  --  0-  The  two  vertical  lines 
correspond  to  signals  w  hose  onsets  are  simultaneous  w  ith  the  beginning  and 
end  of  the  middle  segment  of  the  noise.  The  data  shown  are  averaged  across 
four  subjects. 


fringe  was  also  present,  and  may  have  influenced  the  ob¬ 
tained  results.  The  nest  experiment  provides  a  more  system¬ 
atic  investigation  of  the  effects  of  a  backward  masker  fringe. 

III.  EXPERIMENT  2— BACKWARD  FRINGE 

In  experiment  2,  the  Ntr  forward  fringe,  pulsed  masker, 
and  continuous  masker  conditions  were  examined  again  for 
a  different  set  of  subjects  and  compared  with  several  addi¬ 
tional  conditions,  including  the  QUIET-No-No  masker 
configuration  (Fig,  2,  panel  (d)],  which  at  A<  =  0isanNo 
backward  fringe  condition,  and  Ntr-No-No  masker  config¬ 
uration  (Fig,  2,  panel  (e)  ],  a  condition  similar  to  the  No- 
Nir  masker  configuration  of  KoUmeier  and  Gilkey  ( 1990). 
As  discussed  in  the  Introduction,  the  presence  of  an  No 
backward  fringe  has  also  been  shown  to  increase  the  detect¬ 
ability  of  the  signal  relative  to  the  pulsed  masker  condition 
(Trahiotis  et  al,  1972).  If  so,  under  the  QUIET-No-No 
condition  the  facilitative  influence  of  the  No  backward 
fringe  should  produce  a  threshold  at  Af  =  0  that  is  close  to 
the  value  for  the  continuous  masker  condition.  Further,  if 
the  Nir  forward  fringe  has  no  effect  on  detectability,  the 
threshold  for  the  N)r-No-No  masker  configuration  at 
At  =  0  should  be  the  same  as  that  for  the  QUIET-No-No 
masker  configuration.  On  the  other  hand,  if  the  Ntr  forward 
fringe  is  acting  as  a  forward  masker,  the  thresholds  for  Nw- 
No-No  masker  configuration  should  be  above  those  for  the 
QUIET-No-No  masker  configuration  at  At  =  0,  and  for 
other  values  of  Ar  as  well. 

The  effect  on  detectability  of  an  Ns-  backward  fringe 
was  also  measured  in  two  additional  conditions;  the  QUI- 
ET-No-Nir  masker  configuration  ( Fig.  2,  panel  ( f) ) ,  which 
at  A/  =  0  contains  a  backward  fringe  whose  interaural  pa¬ 


rameters  differ  from  the  preceding  noise;  and  (he  Ns--No- 
Ns-  masker  configuration  [Fig.  2,  panel  (g)],  which  in¬ 
cludes  both  a  forward  and  backward  Ns-  fringe.  Given  that 
an  No  forward  fringe  or  an  No  backward  fringe  increases 
detectability,  but  an  Ns  forward  fringe  decreases  detectabil¬ 
ity,  it  might  be  expected  that  an  Ns  backward  fringe  would 
also  decrease  detectability. 

A.  Method 

1.  Sub/ects 

Three  subjects  (two  males  and  one  female,  all  22  years 
of  age)  were  paid  for  their  participation.  None  had  partici¬ 
pated  in  experiment  1.  All  had  clinically  normal  hearing  as 
measured  by  audiometric  testing.  Subjects  received  exten¬ 
sive  training  on  the  task  before  data  collection  began, 

2.  Procedure 

General  procedural  details  are  similar  to  those  of  experi¬ 
ment  I .  All  of  the  conditions  shown  in  Fig.  2  were  examined 
in  this  experiment.  The  No-No-No  masker  configurations 
[Fig.  2(a)),  QUIET-No-QUIET  masker  configurations 
(Fig.  2(b)],  and  Ns-No-QUIET  masker  configurations 
[Fig.  2(c)]  were  tested  only  at  At  =  0;  the  remaining  mask¬ 
er  configurations  were  tested  at  values  of  A/ between  —  185 
and  185  ms.  Between  four  and  ten  blocks  of  trials  were  run 
for  each  masker  configuration  at  each  value  of  At. 

B.  Results  and  discussion 

Figure  5  plots  threshold  signal  levels  as  a  function  of  At. 
averaged  across  the  three  subjects  for  the  Nr-No-No  and 
QUlET-No-No  masker  configurations.  The  C,  P,  and  ir 
plotled  at  At  =  0  represent  thresholds  for  the  pseudo-contin¬ 
uous  masker  condition,  pulsed  masker  condition,  and  N” 
forward  fringe  condition,  respectively.  Thresholds  for  the 


FIG.  5  Threshold  level  ora20>fn&S]r  signal  is  plotted  asafunciion  of  A/  for 
the  QUIET-No-No  nuisker  configuration  and  the  Nir-No-No  ma-sker 
configuratkni.  The  C  P.  and  v  show  the  thresholds  in  the  pseudo-contin¬ 
uous  masker,  pulsed  ntasker.  and  N  v  forward  fringe  conditions  respective¬ 
ly.  al  A/  =  0.  The  data  shown  are  averaged  across  three  subjects.  Other  de¬ 
tails  are  »»  in  Fig.  4. 
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SIGNAL  ONSET  (ms) 

FIG.  6.  Threshold  level  of  a  20-ms  Srr  signal  is  plotted  asa  fur.-.’tion  of  A/  for 
the  QUIET-No-Nrr  masker  conftguration  and  the  Nn-No-Ntr  masker 
configuration.  The  C,  P,  and  it  show  the  thresholds  in  the  pseudo-contin¬ 
uous  masker  pulsed  masker,  and  Nir  forward  fringe  conditions,  respective¬ 
ly,  at  Ar  =  0.  The  data  shown  are  averaged  across  three  subjects.  Other  de¬ 
tails  are  as  in  Fig.  4. 

pseudo-continuous  masker  and  pulsed  masker  conditions 
differ  by  approximately  7.7  dB,  consistent  with  previous 
findings  and  with  the  results  of  experiment  1 .  Thresholds  for 
the  pulsed  masker  and  Njt  forward  fringe  conditions  differ 
by  about  4.7  dB,  also  in  agreement  with  experiment  I.  In 
addition,  it  can  be  seen  that  threshold  for  the  QUIET-No- 
No  masker  configuration  at  Af  =  0  (No  backward  fringe 
condition )  is  within  .1.0  dB  of  the  value  for  the  pseudo-con¬ 
tinuous  masker,  replicating  the  findings  of  Trahiotis  et  al. 
(1972). 

The  most  important  aspect  of  the  results  shown  in  Fig.  5 
is  the  fact  that  there  is  no  overlap  between  the  functions  for 
the  QUlET-No-No  and  Ntr-No-No  masker  configura¬ 
tions.  The  effect  of  the  Nir  forward  fringe  raises  thresholds 
well  above  those  for  the  QUIET-No-No  masker  configura¬ 
tion  for  all  values  of  At.  At  Af  =  0,  the  two  functions  differ 
by  approximately  8.8  dB.  Further,  the  function  for  the  Njr- 
No-No  masker  configuration  does  not  approach  the  func¬ 
tion  for  the  ((JUIET-No-No  masker  configuration  until  the 
signal  is  well  into  the  No  segment  of  the  noise. 

In  Fig.  6,  the  thresholds  for  the  QUIET-No-Nir  and 
the  Nir-No-Nn"  masker  configurations  are  plotted  as  a  func¬ 
tion  of  Af.  If  the  Nir  backward  fringe  had  no  effect  on  detect¬ 
ability,  it  would  be  expected  that  the  value  for  the  QUIET- 
No-N  jr  masker  configuration  at  Af  =  0  would  equal  that  for 
the  pulsed  masker  condition.  However,  the  Nir  backward 
fringe  raises  thresholds  above  the  pulsed  condition  by  ap¬ 
proximately  6,7  dB.  By  comparing  the  functions  for  the  Nir- 
No-Nir  masker  configuration  and  the  QUIET-No-Nir 
masker  configuration,  it  can  be  seen  that  there  is  a  detrimen¬ 
tal  effect  of  the  Nir  forward  fringe  when  Af  is  equal  to  zero 
( 3.0  dB ) .  For  values  of  Af  greater  than  zero  the  two  curves 
are  essentially  identical.  The  limited  effect  of  the  Nir  forward 
fringe  for  positive  values  of  Af  is  probably  a  ceiling  effect 
(i.e.,  thresholds  are  approximately  equal  to  the  expected 
monaural  threshold). 
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The  results  of  experiments  1  and  2  indicate  that  the  in¬ 
troduction  of  an  Nir  forward  fringe  before  an  No  masker 
elevates  thresholds  above  those  for  the  pulsed  masker  condi¬ 
tion.  This  finding  is  in  conflict  with  Yost’s  (1985)  result  that 
the  Nir  fringe  had  no  effect.  The  remaining  three  experi¬ 
ments  were  undertaken  in  an  attempt  to  reconcile  the  discre¬ 
pancies  between  the  results  of  the  present  study  and  those  of 
Yost. 

IV.  EXPERIMENT  3— OVERTRAINING 

Subjects  may  have  become  confused  by  the  number  of 
experimental  conditions  under  which  they  were  tested  in  ex¬ 
periments  1  and  2  ( Af  and  the  masker  configuration  changed 
randomly  from  block  to  block),  and  may  have  performed 
more  poorly  under  some  conditions  than  they  might  have  if 
fewer  conditions  had  been  tested.  It  is  also  possible  that  they 
may  have  found  the  phase  transition  in  the  noise  phenom¬ 
enologically  distracting  (the  perceptual  effect  of  this  shift  in 
phase  is  a  movement  of  the  auditory  image  within  the  head ) , 
and  that  this  distraction  affected  performance.  In  experi¬ 
ment  3,  the  possibility  that  these  factors  were  influencing  the 
results  was  investigated  by  “overtraining"  subjects  on  only  a 
few  conditions.  That  is,  ifthe  results  observed  in  experiments 
I  and  2  were  due  to  confusion  effects  or  attention  effects,  it 
should  be  possible  to  reduce  these  problems  by  overtraining 
the  subjects  and  limiting  the  number  of  conditions, 

A.  Method 
f.  Subjects 

The  subjects  from  experiment  I  participated  in  experi¬ 
ment  3.  All  subjects  received  22-24  blocks  of  additional 
training  in  the  specific  experimental  conditions  tested  in  ex¬ 
periment  3  before  data  collection  began. 

2.  Procedure 

General  procedural  details  were  similar  to  those  of  ex¬ 
periments  I  and  2,  The  pulsed  masker  condition  and  Nr 
forward  fringe  condition  were  investigated  with  At  =  0  for 
all  stimulus  presentations.  The  experimental  condition  was 
held  constant  within  sets  of  four  blocks  and  alternated 
between  sets  of  four  blocks.  Between  28  and  35  blocks  of 
trials  were  obtained  under  each  configuration. 

B.  Results  snd  discussion 

The  top  two  panels  of  Table  1  show  the  average  differ¬ 
ence  in  threshold  between  the  pulsed  masker  and  the  Nr 
forward  fringe  conditions  for  each  subject  in  experiments  I 
and  3.  As  can  be  seen,  the  average  values  for  the  two  experi¬ 
ments  differ  by  only  about  1 .0  dB,  indicating  that,  for  most 
subjects,  exposure  to  a  large  number  of  values  of  At  did  not 
substantially  affect  the  results  of  experiment  I. 

V.  EXPERIMENT  4— SINGLE-INTERVAL  TASK 

Yost  ( 1985)  assumes  that,  when  the  interaural  param¬ 
eters  of  the  fringe  are  the  same  as  those  of  the  masker,  the 
binaural  system  uses  the  fringe  in  order  to  estimate  the  inter¬ 
aural  parameters  of  the  masker.  He  further  assumes  that  the 
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TABLE  1.  Differences  in  threshold  between  pulsed  masker  and  Ntr  forward 
fringe  masker  conditions  at  At  =  0,  by  subject. 


Subject 

Experiment  1 
multiple  delays 

Difference  (pulsed-Nir fringe) 

CS 

6.4  dB 

SH 

10.5 

RH 

7.6 

JB 

2.3 

Avenge 

6.7 

Experiment  3 

overtraining  at  Ar  =  0 

Subject 

Difference  (pulsed-Ntr  fringe) 

CS 

5.9  dB 

SH 

7.5 

RH 

6.1 

JB 

3.1 

Avenge 

5.7 

Experiment  4 

single-interval  task 

Subject 

Difference  (pulsed-Ne- fringe) 

CS 

8.2  dB 

SH 

6.6 

RH 

5.9 

JB 

3.5 

Avenge 

6.1 

binaural  system  takes  a  considerable  period  of  time  in  order 
to  estimate  the  interaural  parameters  of  a  waveform.  A 
"fringe"  waveform  present  in  the  approximately  500-ms  in¬ 
terval  before  the  onset  of  the  signal  can  influence  the  ob¬ 
tained  thresholds.  Therefore,  it  may  be  inappropriate  to  esti¬ 
mate  dichotic  thresholds  using  techniques  such  as  multiple 
interval  forced-choice  procedures.  That  is,  the  masker  wave¬ 
form  in  one  interval  may  act  as  a  fringe  to  the  masker  in  the 
other  interval  unless  the  time  between  the  observation  inter¬ 
vals  is  quite  large.  Indeed,  under  the  NoSir  condition, 
McFadden  (196d)  observed  a  reduction  of  the  difference 
between  pulsed  and  continuous  maskers  when  a  2AFC  pro¬ 
cedure  was  used  rather  than  a  single-interval  procedure.  In 
his  study,  Yost  used  a  single-interval,  yes/no  procedure, 
whereas  experiments  1-3  of  the  present  study  employed  a 
2AFC  procedure.  Even  though  the  time  between  the  two 
observation  intervals  in  our  procedure  was  relatively  large 
( 760  ms) ,  it  might  be  argued  that  the  stimulus  in  one  interval 
may  have  influenced  the  processing  of  the  stimulus  in  the 
other  interval  and  thus  affected  the  results  (e.g.,  a  SOO-ms 
pause  between  the  observation  intervals  was  not  sufficient  to 
make  2AFC  and  single-interval  procedures  equivalent  in  the 
experiment  of  McFadden,  1966).  In  experiment  4,  the 
pulsed  masker  and  Njr  forward  fringe  conditions  were  reexa¬ 
mined  using  a  single-interval,  yes/no  procedure. 

A.M«thod 

1.Sub/tcl» 

The  four  subjects  from  experiments  1  and  3  participated 
in  experiment  4. 
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2.  Procedure 

General  procedural  details  were  similar  to  those  of  ex¬ 
periments  1-3,  except  as  described  below.  Thresholds  were 
measured  with  a  single-interval,  yes/no  procedure.  The  sig¬ 
nal  level  was  fixed  for  all  trials  within  a  block.  The  beginning 
of  each  trial  was  marked  with  a  60-ms  warning  light,  fol¬ 
lowed  after  760  ms  by  an  observation  interval  marked  by  a 
60-ms  light,  which  was  turned  on  20  ms  prior  to  the  onset  of 
the  signal,  if  present.  As  before,  trials  were  self-paced,  and 
visual  feedback  was  provided  on  a  trial-by-trial  basis. 

Experimental  sessions  were  arranged  in  sets  of  three 
100-trial  blocks.  Each  set  was  preceded  by  a  20-trial  practice 
block.  Within  each  set  of  blocks,  the  experimental  condition 
was  held  constant  and  three  signal  levels  were  tested  on 
successive  blocks.  The  signal  levels  differed  in  2.S-dB  steps, 
with  the  middle  value  corresponding  to  approximately  75% 
correct  performance  for  that  subject.  Values  of  d '  were  cal¬ 
culated  for  each  subject  for  each  block  and  averaged  across 
blocks.  Three-point  psychometric  functions  were  fit  as 
straight  lines  to  the  logarithmic  transform  of 

d'  =  OT(E/No)‘,  (1) 

as  described  by  Egan  et  al.  ( 1969 ) .  Threshold  was  estimated 
as  the  level  corresponding  to  </'=I.16  [FCOjafc 
=  0.794). 

The  two  conditions  tested  in  experiment  3,  QUIET- 
No-QUIET  masker  configuration  [Fig.  2(b)]  and  Nir- 
No-QUIET  configuration  [Fig.  2(c)  1  were  investigated 
with  At  =  0  ( the  pulsed  masker  and  Nir  forward  fringe  con¬ 
ditions).  At  least  six  blocks  of  trials  at  each  of  the  three 
signal  levels  were  obtained  for  each  condition.'* 

B.  Results  and  discussion 

Table  I  shows  the  difference  in  threshold  between  the 
pulsed  masker  and  Nir  forward  fringe  conditions  for  each 
subject.  The  middle  panel  shows  values  from  experiment  3 
(2AFC  procedure),  and  the  bottom  panel  shows  values 
from  experiment  4  ( single-interval  procedure ) .  The  average 
difference  between  the  conditions  with  the  2AFC  procedure 
in  experiment  3  was  5.7  dB,  whereas  the  average  difference 
between  the  conditions  with  the  single-interval  procedure  in 
experiment  4  was  6. 1  dB.  While  this  slight  effect  of  the  psy¬ 
chophysical  procedure  is  in  the  same  direction  as  the  1 .9  dB 
effect  reported  by  McFadden,  it  seems  safe  to  conclude  that 
the  use  of  a  2AFC  procedure  in  experiments  1-3  did  not 
significantly  influence  the  results. 

VI.  EXPERIMENT  5— SIGNAL  DURATION 

In  Yost’s  ( 1985)  study,  the  signal  duration  was  30  ms. 
as  measured  from  the  beginning  of  the  rise  to  the  end  of  the 
fall,  whereas  in  experiments  1-4  the  signal  duration  was  20 
ms.  Robinson  and  Trahiotis  ( 1972),  in  measuring  the  influ¬ 
ence  of  signal  duration  on  forward  fringe  effects,  found  con¬ 
siderable  differences  in  results  obtained  for  short  ( 37-ms) 
and  long  (261-ms)  signals.'  Specifically,  they  found  a  much 
larger  difference  between  continuous  and  pulsed  maskers 
under  dichotic  conditions  when  the  signal  duration  was 
short.  The  seemingly  related  “overshoot  effect"  is  also  de- 
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pendent  on  signal  duration  {e.g.,  Zwicker.  1965;  Fasti, 
1976).  However,  the  overshoot  effect  s'hen  measured  for 
dichotic  conditions  is  usually  quite  smail  (e.g.,  Trahiotis  el 
a!.,  1972;  McFadden,  1988).  Nevertheless,  it  is  possible  that 
the  shorter  duration  of  the  signal  used  in  the  present  study 
led  to  a  larger  difference  between  puksed  masker  and  Ntr 
forward  fringe  conditions  than  would  have  been  found  with 
a  longer  duration  signal.  This  possibility  was  examined  more 
closely  in  experiment  5. 

A.  Method 

1.  Subjects 

Four  subjects  ( two  males  and  two  females  between  the 
ages  of  19  and  22  years)  were  paid  for  their  participation  in 
the  experiment.  All  had  clinically  normal  hearing  as  mea¬ 
sured  by  audiometric  testing.  Two  subjects  ( JB  and  CS)  had 
participated  in  experiments  1,  3,  and  4.  All  subjects  were 
extensively  trained  on  the  experimental  task  before  data  col¬ 
lection  began. 

2.  Procedure 

Procedural  details  were  the  same  as  in  experiment  4, 
with  the  exception  that  signal  durations  of  both  20  and  30 
ms,  as  measured  from  the  beginning  of  the  rise  to  the  end  of 
the  fall,  were  used.  The  QUIET-No-QUIET  and  Ntr-No- 
QUIET  masker  configurations  with  Ar  =  Owere  tested  (i.e., 
pulsed  masker  and  Nir  forward  fringe  conditions).^  Between 
13  and  27  blocks  of  trials  for  each  signal  duration  and  level 
under  each  condition  were  obtained  for  each  subject.'' 

B.  Results  and  discussion 

Table  II  shows  the  difference  between  the  pulsed  masker 
and  Nit  forward  fringe  conditions  for  each  of  the  four  sub¬ 
jects.  As  can  be  seen,  there  is  little  difference  in  the  size  of  the 
effect  obtained  with  the  20-  and  30-ms  signals,  with  an  aver¬ 
age  difference  for  the  30-ms  signal  of  3. 1  dB  and  an  average 
difference  for  the  20-ms  signal  of  4.2  dB. 


TABLE  11.  Differences  in  threshold  between  pulsed  masker  and  Nir  for¬ 
ward  fringe  masker  conditions  at  At  =  0,  by  subject. 


Subject 

Experiment  S 

30-ms  signals 

Difference  (pulsed-Ntr  fringe) 

CS 

3.7  dB 

KR 

3.1 

RQ 

2.6 

JB 

2.9 

Average 

3.1 

Experiment  5 

20-ms  signals 

Subject 

Difference  (pulsed-Nrr  fringe) 

CS 

6.6  dB 

KR 

3.2 

RO 

4.3 

JB 

2.4 

Average 

4.2 
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Although  the  size  of  the  effect  is  somewhat  smaller  for 
the  longer  duration  signal,  the  presence  of  the  Nrr  forward 
fringe  still  appreciably  reduces  detectability  for  all  four  sub¬ 
jects.  Thus  it  seems  unlikely  that  the  use  of  a  20-ms  rather 
than  30-ms  signal  in  experiments  1-4  substantially  altered 
the  results. 

VII.  GENERAL  DISCUSSION 

Yost  ( 1985)  found  that,  if  the  interaural  parameters  of 
the  forward  fringe  differed  from  the  interaural  parameters  of 
the  portion  of  the  masker  that  overlapped  with  the  signal,  the 
presence  of  the  fringe  had  no  effect  on  detectability.  In  con¬ 
trast,  Kollmeier  and  Gilkey  ( 1990)  argued  that  an  Nir  seg¬ 
ment  of  noise  preceding  an  No  masker  acted  like  a  forward 
masker. 

The  results  of  experiment  1  of  the  present  study  failed  to 
replicate  those  of  Yost  ( 1985 ) .  That  is,  when  At  is  equal  to 
zero,  thresholds  in  the  Nir  forward  fringe  condition  are  6.7 
dB  higher  than  in  the  pulsed  masker  condition.  Moreover, 
thresholds  for  the  Nir-No-QUIET  masker  configuration 
were  above  those  for  the  QUIET-No-QUIET  masker  con¬ 
figuration  for  all  values  of  At,  demonstrating  that  the  Nir 
segment  of  the  noise  has  effects  that  last  long  after  its  offset. 
In  experiment  2,  these  effects  were  reexamined  in  conditions 
where  an  No  backward  fringe  was  also  present.  Again, 
thresholds  were  about  8.8  dB  higher  when  the  Nir  forward 
fringe  was  added.  Also,  the  function  that  related  threshold  to 
At  for  the  Nir-No-No  masker  configuration  was  always 
above  that  for  the  QUIET-No-No  masker  configuration, 
again  suggesting  that  the  Nir  segment  of  the  noise  has  a  long- 
lasting  effect.  Further,  the  shape  of  the  Nir-No-No  function 
resembled  that  of  a  forward  masking  function  and  was  simi¬ 
lar  to  the  function  obtained  by  Kollmeier  and  Gilkey  ( 1990) 
for  a  similar  condition.  The  addition  of  an  Nir  backward 
fringe  was  also  shown  to  raise  threshold  at  Ar  ==  0  over  that 
for  the  pulsed  masker  condition.  Experiments  3-5  indicate 
that  the  difference  between  the  results  of  the  present  study 
and  those  of  Yost  cannot  be  readily  explained  by  training 
effects,  the  psychophysical  procedures  employed,  or  the  du¬ 
ration  of  the  signal.^ 

Based  on  the  previous  literature,  it  appears  that,  under 
dichotic  presentations,  the  addition  of  a  forward  or  back¬ 
ward  masker  fringe  that  has  the  same  interaural  parameters 
as  those  of  the  masker  will  enhance  the  detectability  of  the 
signal.  On  the  other  hand,  the  results  of  the  present  study,  as 
well  as  those  of  Kollmeier  and  Gilkey  ( 1990)  and  Kollmeier 
( 1986),  indicate  that  the  addition  of  a  forward  or  backward 
fringe  whose  interaural  parameters  are  different  from  those 
of  the  masker  will  decrease  the  detectability  of  the  signal. 

There  are  at  least  three  possible  approaches  to  explain¬ 
ing  these  results.  First,  a  phase  transition  in  the  noise  pro¬ 
duces  a  distinct  sensation,  which,  to  a  first  approximation, 
can  be  described  as  "movement"  of  the  auditory  image.  Per¬ 
haps  the  subject's  ability  to  detect  the  signal  when  it  is  close 
to  the  transition  in  the  noise  is  hampered  because  he  is  dis¬ 
tracted  by  this  change  in  the  auditory  image.  As  the  signal  is 
moved  away  from  the  transition  in  the  noise,  the  subject  is 
better  able  to  focus  his  attention  on  the  signal.  One  problem 
with  this  argument  is  that  it  provides  no  specific  explanation 
for  the  difference  between  the  pulsed  masker  and  continuous 
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masker  conditions,  except  perhaps  to  assume  that  the  onset 
of  the  masker  is  also  distracting,  although  less  distracting 
than  the  phase  transition  in  the  noise.  A  second  problem 
with  this  argument  is  that  we  might  expect  the  distracting 
effects  of  the  transition  in  the  noise  to  raise  thresholds  during 
the  Ntt  segment  of  the  masker  as  well.  However,  instead  of 
getting  worse,  thresholds  actually  seem  to  improve  slightly 
as  the  signal  is  brought  closer  to  the  transition  in  the  noise 
within  the  Ntr  segment.  Indeed,  when  the  signal  offset  and 
the  transition  are  simultaneous,  thresholds  appear  to  be  at 
least  slightly  lower  than  when  the  signal  is  well  within  the 
Njt  segment  of  the  noise. 

The  second  explanation  assumes,  as  did  Kollmeier  and 
Gilkey,  that  the  Nir  forward  fringe  acts  as  a  forward  masker. 
If  we  assume  an  E-C  model  (Durlach,  1972)  that  selects  an 
equalization  transformation  that  maximizes  the  Sir  signal  at 
the  output  of  the  cancellation  stage,  then  the  output  will  be 
large  during  the  Nir  segment  of  the  noise  ( i.e.,  the  noise  will 
be  doubled  in  amplitude)  and  small  during  the  No  segment 
of  the  noise  (i.e.,  the  noise  will  be  canceled  to  the  limit  deter¬ 
mined  by  the  internal  noise).  Further,  if  activity  in  the  bin¬ 
aural  channel  decays  slowly,  then  activity  associated  with 
the  Nff-  segment  of  the  noise  might  continue  after  the  transi¬ 
tion  in  the  noise  and  act  as  a  forward  masker.  Thus  the  grad¬ 
ual  increase  in  detectability  as  the  signal  is  moved  from  the 
N>r  segment  to  the  No  segment  in  Fig.  3  might  represent  the 
time  course  of  temporal  masking.  One  problem  with  this 
interpretation  is  that  it  does  not  specifically  address  the  dif¬ 
ference  between  the  No  forward  fringe  condition  and  the 
pulsed  masker  condition,  but  only  states  that  the  Nir  for¬ 
ward  fringe  condition  should  be  worse  than  either. 

The  third  explanation  assumes,  as  did  McFadden 
( 1966),  Robinson  and  Trahiotis  ( 1972),  and  Yost  ( 1985), 
that  the  masker  fringe  provides  a  reference  or  baseline 
against  which  the  signal  is  detected.  For  example,  in  an 
NoSir  detection  task  an  No  forward  fringe  provides  a  refer¬ 
ence  of  diotic  cues.  On  noise-alone  trials,  the  stimulus  re¬ 
mains  diotic  throughout,  while  on  signal-plus-noise  trials 
there  arc  ongoing  dichotic  cues  during  the  time  the  signal  is 
on  (the  ongoing  cue),  and  there  is  a  transition  in  interaural 
parameters  from  diotic  to  dichotic  at  the  signal  onset  (the 
onset  cue).  When  no  fringe  is  present  (the  pulsed  masker 
condition),  the  onset  cue  is  not  available,  and  the  subject 
must  rely  on  the  ongoing  cue  alone.  An  Nir  fringe,  on  the 
other  hand,  would  provide  a  reference  of  dichotic  cues.  On 
noise-alone  trials,  the  transition  would  be  from  dichotic  in¬ 
teraural  cues  to  diotic  interaural  cues,  and  on  signal-plus- 
noise  trials  the  transition  would  be  from  dichotic  interaural 
cues  to  dichotic  interaural  cues  of  reduced  average  magni¬ 
tude.  It  could  well  be  argued  that  this  condition  would  pro¬ 
vide  a  less  effective  onset  cue  than  is  provided  in  the  No 
forward  fringe  condition,  but  probably  no  less  effective  than 
the  nonexistent  onset  cue  in  the  pulsed  masker  condition. 
However,  while  in  the  No  forward  fringe  condition  and  the 
pulsed  masker  condition  there  are  never  dichotic  cues  on 
noise-alone  trials,  under  the  Ntr  forward  fringe  condition 
there  would  be  interaural  difference  cues  present  on  both 
noise-alone  and  signal-plus-noise  trials.  Thus  it  could  be  ar¬ 
gued  that  the  ongoing  cue  is  less  reliable  than  under  either 


the  No  forward  fringe  condition  or  the  pulsed  masker  condi¬ 
tion.  If  so,  one  might  expect  performance  to  be  best  for  the 
No  forward  fringe  condition  ( both  onset  and  ongoing  cues ) , 
and  worst  for  the  Nrr  forward  fringe  condition  (poor  onset 
cue  and  poor  ongoing  cue).  Thus  this  explanation  is  consis¬ 
tent  with  the  pattern  of  results  observed  here. 

As  suggested  by  Yost,  a  similar  argument  can  be  made 
in  the  context  of  the  E-C  model.  If  we  assume  that  the  equal¬ 
ization  stage  of  the  model  adopts  an  equalization  transfor¬ 
mation  that  attempts  to  minimize  the  noise  at  each  instant  in 
time.’  then  under  the  No  forward  fringe  condition,  the  sub¬ 
ject  would  choose  an  equalization  transformation  that  maxi¬ 
mally  canceled  the  No  fringe.  Given  that  this  equalization 
transformation  is  also  optimal  for  detecting  the  signal,  per¬ 
formance  will  be  good.  Under  the  Nir  forward  fringe  condi¬ 
tion,  the  subject  would  choose  an  equalization  transfonna- 
tion  that  maximally  cancels  the  Ntr  fringe.  Such  a 
transformation  would  double  the  No  portion  of  the  masker 
and  cancel  the  signal,  a  very  ineffective  strategy  for  detecting 
the  signal.  If  we  assume  that  the  system  can  change  its  equal¬ 
ization  transformation  only  slowly,  then  this  transformation 
will  still  be  in  effect  during  the  No  segment  of  the  noise,  and 
performance  will  be  poor.  Although  it  is  unclear  what  equal¬ 
ization  transformation  the  model  would  adopt  in  the  quiet 
before  the  onset  of  a  pulsed  masker,  it  is  not  unreasonable  to 
assume  that  the  system  adopts  some  random  transforma¬ 
tion,  which,  on  average,  is  more  effective  than  the  transfor¬ 
mation  adopted  in  the  Nir  forward  fringe  condition,  but  less 
effective  than  the  transformation  adopted  in  the  No  forward 
fringe  condition.  Thus  this  argument  predicts  the  ordering 
of  the  three  conditions,  if  not  the  quantitative  details.  Within 
this  view,  the  gradual  decrease  in  threshold  after  the  Nir 
segment  of  the  noise  indicates  the  time  required  to  establish 
the  correct  equalization  transformation. 

In  summary,  it  is  not  clear  why  the  results  obtained  in 
the  present  study  differ  from  those  of  Yost  ( 1985)  The  ele¬ 
vation  of  thresholds  resulting  from  addition  of  an  Nn-  fringe 
appears  to  be  robust  (all  nine  subjects  who  participated  in 
the  experiments  reported  here  show  the  effect ) .  However,  it 
should  be  noted  that  there  was  some  intersubject  variability 
in  the  size  of  the  effect.  Overall,  investigations  into  the  effects 
of  adding  forward  and  backward  masker  fringes  suggest  that 
the  binaural  system  is  sensitive  to  activity  occurring  during  a 
considerable  period  before  and  after  the  presentation  of  a 
signal,  and  that  any  such  activity  may  have  long-lasting  ef¬ 
fects.  None  of  the  models  presented  here  or  in  the  literature 
provides  a  very  quantitative  description  of  the  data.  How¬ 
ever,  a  model  which  assumes  that  the  masker  fringe  provides 
a  baseline  or  reference  of  interaural  information  is  not  incon¬ 
sistent  with  the  data  presented  here. 
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‘  This  result  agrees  with  most  later  studies  using  moderate>  to  long-dura* 
tion.  narrow-band,  low-frequency  signals  masked  by  broadband  maskers 
(Tucker  et  al.,  1968;  Robinson  and  Trahiotis,  1972;  Wier  et  at,  1977; 
Kohlrausch,  1986).  However,  substantial  effects  are  sometimes  observed 
when  the  signal  duration  is  reduced,  the  masker  bandwidth  is  narrowed,  or 
the  signal  frequency  is  raised.  These  effects,  when  observed  under  dioctc 
conditions,  may  be  related  to  the  overshoot  effect  (Zwicker,  1965;  Fasti. 
1976;  Carlyon.  1987).  However,  while  substantial  differences  exist 
between  burst  and  continuous  masking  for  dichotic  conditions,  dichotic 
overshoot  is  usually  small  (Trahiotis  e/d/..  1972;  McFadden.  1988). 

*  The  durations  shown  here  are  the  total  duration  of  the  signal  ( i.e.,  from  the 
beginning  of  the  rise  to  the  end  of  the  fall),  and  thus  are  different  from  the 
original  durations  reported  by  the  authors. 

'Bell  ( 1972)  measured  the  detectafcHlitycff  an  Srsignal  in  a  masker  (hat  was 
switched  from  NU  ( interaurally  uncorrelaied  noise)  to  No  within  a  trial. 
The  signal  was  always  presented  during  the  No  portion  of  the  noise,  but  the 
duration  of  the  No  fringe  either  before  or  aAer  the  signal  was  varied.  He 
found  that  performance  was  worse  when  no  No  fringe  was  present  and 
suggested  that  this  case  was  comparable  to  the  pulsed  rnasker  condition. 
However,  becau.se  he  did  not  actu^ly  measure  thresholds  under  the  pulsed 
masker  condition,  it  is  not  possible  to  determine  whether  an  SU  forward 
fringe  affects  detectability. 

*  For  a  few  blocks,  at  the  highest  signal  levels,  the  obtained  response  matri¬ 
ces  had  empty  cells.  The  data  for  these  blocks  were  discarded. 

'  When  the  duration  of  the  signal  was  30  ms,  the  duration  of  the  No  segment 
of  the  noise  was  increased  to  30  ms,  and  the  duration  of  the  Nir  forward 
fringe  was  decreased  to  345  ms. 

^Recently,  Yost  (1988)  has  pointed  out  another  potentially  significant  dif¬ 
ference  in  the  stimuli  between  the  two  studies.  Ilie  transition  between  the 
fringe  and  masker  segments  of  our  generated  noise  ( i.e..  before  filtering) 
was  essentially  instantaneous  and  produced  no  change  in  intensity  within 
the  monaural  channels.  In  Yost’s  ( 1985)  study  the  transition  between  the 
fringe  and  the  masker  was  more  gradual.  The  fringe  ended  with  a  5-ms 
linear  decay  followed  by  an  approximately  1  -ms  gap  and  then  by  the  S-m$ 
linear  onset  of  the  masker  or  masker  plus  signal.  Hence,  any  phase  transi¬ 
tion  occurred  over  an  U-ms  interval.  In  addition,  there  was  a  gap  of  7  ms 
( if  measured  from  the  half  power  points )  between  the  fringe  and  the  mask¬ 
er.  The  exact  implications  of  th^  stimulus  differences  are  unclear,  but 
there  is  at  least  some  possibility  that  they  may  be  resptmsible  for  the  ob¬ 
served  differences-  For  example,  the  gap  may  be  significant  in  light  of  the 
work  of  HaAer  and  Buell  ( 1983),  indicating  that  a  gap  in  a  sequence  of 
stimuli  appeared  to  have  the  effect  of  '’restarting"  the  binaural  system. 
Thus  it  is  at  least  possible  that  the  gap  between  the  fringe  and  the  masker 
may  have  allowed  the  binaural  analyzer  to  escape  (he  influence  of  the  Ns- 
fringe.  However,  in  (his  case  it  is  not  clear  why  there  is  still  a  facilitating 
effect  of  the  No  fringe. 

^The  data  of  McFadden  ( 1967)  would  indicate  (hat  subjects  can  do  some¬ 
thing  equivalent  to  adopting  different  equalizatkm  tranafonnations  on  dif¬ 
ferent  trials.  He  investigated  detectability  under  (he  NoSo  condition  and 
the  NirSo  condition  and  found  that  subjects  did  no  worse  when  the  two 
types  of  (rials  were  randomly  mixed  within  the  same  block  than  when  the 
two  types  d  (riab  were  presented  in  separate  Mocks.  UnpubKshed  data 
from  our  laboratory  indkate  that  even  when  sul^ects  are  overtrained  to 
Inten  for  NoSo  or  NoSw  trials,  their  perfonnanoe  is  not  significantly  bam- 
pered  on  unexpected  probe  trials  containing  (be  other  ctmdition.  These 
results  allow  for  the  posaibtlity  (hat  the  subject  can  alter  his  processing 
strategies  within  a  trisJ.  On  the  odier  hand,  a  more  reasonable  interpreta¬ 
tion  mi^t  be  to  assume  that  there  are  multifde  E-C  ebanneia,  each  tuned 
to  a  particular  equalization  transformation.  In  this  case,  the  question  of 
how  rapidly  the  equalization  transformation  can  be  changed  within  a 


channel  would  be  restated  as  how  the  binaural  analyzer  chooses  the  appro¬ 
priate  channel  to  monitor,  and  how  rapidly  that  choice  can  be  made. 
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